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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Behan, Dominic P. 

Lehmarm-Bruinsma, Karin 
Chalmers, Derek T. 
Lowitz, Kevin P. 
Lin, I-Lin 
Dang, Huong T. 
Chen, Ruoping 
Liaw, Chen W. 
Gore, Martin J. 
White, Carol 

:ii) TITLE OF INVENTION: Non - Endogenous , Cons ti tut ively Activated Human 

G Protein- Coupled Receptors 

(iii) NUMBER OF SEQUENCES: 146 



(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Arena Pharmaceuticals, Inc. 

(B) STREET: 6166 Nancy Ridge Drive 

(C) CITY: San Diego 

(D) STATE: CA 

(E) COUNTRY: USA 

(F) ZIP: 92121 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS -DOS 

(D) SOFTWARE: Patent In Release #1.0, Version #1.3 0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 

(B) FILING DATE: 

(C) CLASSIFICATION: 



(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Burgoon, Richard P. 

(B) REGISTRATION NUMBER: 34,787 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: ( 858 ) 453 -7200 

(B) TELEFAX: (858)453-7210 



(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1260 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

ATGGTCTTCT CGGCAGTGTT GACTGCGTTC CATACCGGGA CATCCAACAC AACATTTGTC 60 

5 GTGTATGAAA ACACCTACAT GAATATTACA CTCCCTCCAC CATTCCAGCA TCCTGACCTC 120 

AGTCCATTGC TTAGATATAG TTTTGAAACC ATGGCTCCCA CTGGTTTGAG TTCCTTGACC 180 

GTGAATAGTA CAGCTGTGCC CACAACACCA GCAGCATTTA AGAGCCTAAA CTTGCCTCTT 24 0 

CAGATCACCC TTTCTGCTAT AATGATATTC ATTCTGTTTG TGTCTTTTCT TGGGAACTTG 300 

GTTGTTTGCC TCATGGTTTA CCAAAAAGCT GCCATGAGGT CTGCAATTAA CATCCTCCTT 360 

10 GCCAGCCTAG CTTTTGCAGA CATGTTGCTT GCAGTGCTGA ACATGCCCTT TGCCCTGGTA 420 

ACTATTCTTA CTACCCGATG GATTTTTGGG AAATTCTTCT GTAGGGTATC TGCTATGTTT 480 

TTCTGGTTAT TTGTGATAGA AGGAGTAGCC ATCCTGCTCA TCATTAGCAT AGATAGGTTC ' 54 0 

CTTATTATAG TCCAGAGGCA GGATAAGCTA AACCCATATA GAGCTAAGGT TCTGATTGCA 600 

GTTTCTTGGG CAACTTCCTT TTGTGTAGCT TTTCCTTTAG CCGTAGGAAA CCCCGACCTG 660 

15 CAGATACCTT CCCGAGCTCC CCAGTGTGTG TTTGGGTACA CAACCAATCC AGGCTACCAG 72 0 

GCTTATGTGA TTTTGATTTC TCTCATTTCT TTCTTCATAC CCTTCCTGGT AATACTGTAC 78 0 

TCATTTATGG GCATACTCAA CACCCTTCGG CACAATGCCT TGAGGATCCA TAGCTACCCT 840 

GAAGGTATAT GCCTCAGCCA GGCCAGCAAA CTGGGTCTCA TGAGTCTGCA GAGACCTTTC 900 

CAGATGAGCA TTGACATGGG CTTTAAAACA CGTGCCTTCA CCACTATTTT GATTCTCTTT 960 

20 GCTGTCTTCA TTGTCTGCTG GGCCCCATTC ACCACTTACA GCCTTGTGGC AACATTCAGT 1020 

AAGCACTTTT ACTATCAGCA CAACTTTTTT GAGATTAGCA CCTGGCTACT GTGGCTCTGC 1080 

TACCTCAAGT CTGCATTGAA TCCGCTGATC TACTACTGGA GGATTAAGAA ATTCCATGAT 1140 

GCTTGCCTGG ACATGATGCC TAAGTCCTTC AAGTTTTTGC CGCAGCTCCC TGGTCACACA 1200 

AAGCGACGGA TACGTCCTAG TGCTGTCTAT GTGTGTGGGG AACATCGGAC GGTGGTGTGA 1260 
25 (3) INFORMATION FOR SEQ ID N0:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 419 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
30 (D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Val Phe Ser Ala Val Leu Thr Ala Phe His Thr Gly Thr Ser Asn 
15 10 15 

5 Thr Thr Phe Val Val Tyr Glu Asn Thr Tyr Met Asn He Thr Leu Pro 

20 25 30 

Pro Pro Phe Gin His Pro Asp Leu Ser Pro Leu Leu Arg Tyr Ser Phe 
35 40 45 

Glu Thr Met Ala Pro Thr Gly Leu Ser Ser Leu Thr Val Asn Ser Thr 
1^ 50 55 go 

Ala Val Pro Thr Thr Pro Ala Ala Phe Lys Ser Leu Asn Leu Pro Leu 

70 75 80 

Gin He Thr Leu Ser Ala He Met He Phe He Leu Phe Val Ser Phe 
85 90 95 

15 Leu Gly Asn Leu Val Val Cys Leu Met Val Tyr Gin Lys Ala Ala Met 

100 105 110 

Arg Ser Ala He Asn He Leu Leu Ala Ser Leu Ala Phe Ala Asp Met 
115 120 125 

Leu Leu Ala Val Leu Asn Met Pro Phe Ala Leu Val Thr He Leu Thr 
20 130 135 140 

Thr Arg Trp He Phe Gly Lys Phe Phe Cys Arg Val Ser Ala Met Phe 

150 155 160 

Phe Trp Leu Phe Val He Glu Gly Val Ala He Leu Leu He He Ser 
165 170 175 

25 He Asp Arg Phe Leu He He Val Gin Arg Gin Asp Lys Leu Asn Pro 

180 185 190 

Tyr Arg Ala Lys Val Leu He Ala Val Ser Trp Ala Thr Ser Phe Cys 
195 200 205 

Val Ala Phe Pro Leu Ala Val Gly Asn Pro Asp Leu Gin He Pro Ser 
30 210 215 220 

Arg Ala Pro Gin Cys Val Phe Gly Tyr Thr Thr Asn Pro Gly Tyr Gin 
225 230 235 240 

Ala Tyr Val He Leu He Ser Leu He Ser Phe Phe He Pro Phe Leu 
245 250 255 



35 



Val He Leu Tyr Ser Phe Met Gly He Leu Asn Thr Leu Arg His Asn 
260 265 270 



m 

IT' 

III 
m 
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Ala Leu Arg He His Ser Tyr Pro Glu Gly He Cys Leu Ser Gin Ala 
275 280 285 

Ser Lys Leu Gly Leu Met Ser Leu Gin Arg Pro Phe Gin Met Ser He 
290 295 300 

5 Asp Met Gly Phe Lys Thr Arg Ala Phe Thr Thr He Leu He Leu Phe 

310 315 320 

Ala Val Phe He Val Cys Trp Ala Pro Phe Thr Thr Tyr Ser Leu Val 
325 330 335 

Ala Thr Phe Ser Lys His Phe Tyr Tyr Gin His Asn Phe Phe Glu He 
10 340 .345 350 

Ser Thr Trp Leu Leu Trp Leu Cys Tyr Leu Lys Ser Ala Leu Asn Pro 
355 360 365 

Leu He Tyr Tyr Trp Arg He Lys Lys Phe His Asp Ala Cys Leu Asp 
370 375 380 



15 



20 



Met Met Pro Lys Ser Phe Lys Phe Leu Pro Gin Leu Pro Gly His Thr 
385 390 395 400 

Lys Arg Arg He Arg Pro Ser Ala Val Tyr Val Cys Gly Glu His Arg 
405 410 415 

Thr Val Val 



(4) INFORMATION FOR SEQ ID NO : 3 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1119 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

ATGTTAGCCA ACAGCTCCTC AACCAACAGT TCTGTTCTCC CGTGTCCTGA CTACCGACCT 60 

30 ACCCACCGCC TGCACTTGGT GGTCTACAGC TTGGTGCTGG CTGCCGGGCT CCCCCTCAAC 12 0 

GCGCTAGCCC TCTGGGTCTT CCTGCGCGCG CTGCGCGTGC ACTCGGTGGT GAGCGTGTAC 180 

ATGTGTAACC TGGCGGCCAG CGACCTGCTC TTCACCCTCT CGCTGCCCGT TCGTCTCTCC 24 0 

TACTACGCAC TGCACCACTG GCCCTTCCCC GACCTCCTGT GCCAGACGAC GGGCGCCATC 3 00 

TTCCAGATGA ACATGTACGG CAGCTGCATC TTCCTGATGC TCATCAACGT GGACCGCTAC 3 60 
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GCCGCCATCG 


TGCACCCGCT 


GCGACTGCGC 


CACCTGCGGC 


GGCCCCGCGT 


GGCGCGGCTG 


420 


CTCTGCCTGG 


GCGTGTGGGC 


GCTCATCCTG 


GTGTTTGCCG 


TGCCCGCCGC 


CCGCGTGCAC 


480 


AGGCCCTCGC 


GTTGCCGCTA 


CCGGGACCTC 


GAGGTGCGCC 


TATGCTTCGA 


GAGCTTCAGC 


540 


GACGAGCTGT 


GGAAAGGCAG 


GCTGCTGCCC 


CTCGTGCTGC 


TGGCCGAGGC 


GCTGGGCTTC 


600 


CTGCTGCCCC 


TGGCGGCGGT 


GGTCTACTCG 


TCGGGCCGAG 


TCTTCTGGAC 


GCTGGCGCGC 


660 


CCCGACGCCA 


CGCAGAGCCA 


GCGGCGGCGG 


AAGACCGTGC 


GCCTCCTGCT 


GGCTAACCTC 


720 


GTCATCTTCC 


TGCTGTGCTT 


CGTGCCCTAC 


AACAGCACGC 


TGGCGGTCTA 


CGGGCTGCTG 


780 


CGGAGCAAGC 


TGGTGGCGGC 


CAGCGTGCCT 


GCCCGCGATC 


GCGTGCGCGG 


GGTGCTGATG 


840 


GTGATGGTGC 


TGCTGGCCGG 


CGCCAACTGC 


GTGCTGGACC 


CGCTGGTGTA 


CTACTTTAGC 


900 


GCCGAGGGCT 


TCCGCAACAC 


CCTGCGCGGC 




^^^^Ti ^^^^^^ 

CGCACCGGGC 


CAGGACCTCG 


960 


GCCACCAACG 


GGACGCGGGC 


GGCGCTCGCG 


CAATCCGAAA 


GGTCCGCCGT 


CACCACCGAC 


1020 


GCCACCAGGC 


CGGATGCCGC 


CAGTCAGGGG 


CTGCTCCGAC 


CCTCCGACTC 


CCACTCTCTG 


1080 


TCTTCCTTCA 


CACAGTGTCC 


CCAGGATTCC 


GCCCTCTGA 






1119 


(5) INFORMATION FOR SEQ ID NO: 4: 











15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 372 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: not relevant 

20 



(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Met Leu Ala Asn Ser Ser Ser Thr Asn Ser Ser Val Leu Pro Cys Pro 
15 10 15 

Asp Tyr Arg Pro Thr His Arg Leu His Leu Val Val Tyr Ser Leu Val 
2^ 20 25 30 

Leu Ala Ala Gly Leu Pro Leu Asn Ala Leu Ala Leu Trp Val Phe Leu 
35 40 45 

Arg Ala Leu Arg Val His Ser Val Val Ser Val Tyr Met Cys Asn Leu 
50 55 60 



30 



Ala Ala Ser Asp Leu Leu Phe Thr Leu Ser Leu Pro Val Arg Leu Ser 
^5 70 75 80 

Tyr Tyr Ala Leu His His Trp Pro Phe Pro Asp Leu Leu Cys Gin Thr 



f|3 

in 



1^ 
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85 90 95 

Thr Gly Ala He Phe Gin Met Asn Met Tyr Gly Ser Cys He Phe Leu 

105 110 

Met Leu He Asn Val Asp Arg Tyr Ala Ala He Val His Pro Leu Arg 
^ 115 120 125 

Leu Arg His Leu Arg Arg Pro Arg Val Ala Arg Leu Leu Cys Leu Gly 
130 135 140 

Val Trp Ala Leu He Leu Val Phe Ala Val Pro Ala Ala Arg Val His 
1^5 150 155 160 

Arg Pro Ser Arg Cys Arg Tyr Arg Asp Leu Glu Val Arg Leu Cys Phe 
165 170 175 

Glu Ser Phe Ser Asp Glu Leu Trp Lys Gly Arg Leu Leu Pro Leu Val 
180 185 190 



10 



Leu Leu Ala Glu Ala Leu Gly Phe Leu Leu Pro Leu Ala Ala Val Val 

.2 15 195 200 205 



20 



Tyr Ser Ser Gly Arg Val Phe Trp Thr Leu Ala Arg Pro Asp Ala Thr 
210 215 220 

Gin Ser Gin Arg Arg Arg Lys Thr Val Arg Leu Leu Leu Ala Asn Leu 
225 230 235 240 

Val He Phe Leu Leu Cys Phe Val Pro Tyr Asn Ser Thr Leu Ala Val 
245 250 255 

Tyr Gly Leu Leu Arg Ser Lys Leu Val Ala Ala Ser Val Pro Ala Arg 
260 265 270 

Asp Arg Val Arg Gly Val Leu Met Val Met Val Leu Leu Ala Gly Ala 
25 275 280 285 

Asn Cys Val Leu Asp Pro Leu Val Tyr Tyr Phe Ser Ala Glu Gly Phe 
290 295 300 

Arg Asn Thr Leu Arg Gly Leu Gly Thr Pro His Arg Ala Arg Thr Ser 
305 310 315 320 

30 Ala Thr Asn Gly Thr Arg Ala Ala Leu Ala Gin Ser Glu Arg Ser Ala 

325 330 335 

Val Thr Thr Asp Ala Thr Arg Pro Asp Ala Ala Ser Gin Gly Leu Leu 
340 345 350 

Arg Pro Ser Asp Ser His Ser Leu Ser Ser Phe Thr Gin Cys Pro Gin 
35 355 360 365 



Asp Ser Ala Leu 
370 
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^0 

IP 
lU 

in 
ill 

i 

IP 



(6) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1107 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

ATGGCCAACT CCACAGGGCT GAACGCCTCA GAAGTCGCAG GCTCGTTGGG GTTGATCCTG 60 

GCAGCTGTCG TGGAGGTGGG GGCACTGCTG GGCAACGGCG CGCTGCTGGT CGTGGTGCTG 120 

CGCACGCCGG GACTGCGCGA CGCGCTCTAC CTGGCGCACC TGTGCGTCGT GGACCTGCTG 180 

GCGGCCGCCT CCATCATGCC GCTGGGCCTG CTGGCCGCAC CGCCGCCCGG GCTGGGCCGC 24 0 

GTGCGCCTGG GCCCCGCGCC ATGCCGCGCC GCTCGCTTCC TCTCCGCCGC TCTGCTGCCG : 3 00 

GCCTGCACGC TCGGGGTGGC CGCACTTGGC CTGGCACGCT ACCGCCTCAT CGTGCACCCG 360 

CTGCGGCCAG GCTCGCGGCC GCCGCCTGTG CTCGTGCTCA CCGCCGTGTG GGCCGCGGCG 420 

GGACTGCTGG GCGCGCTCTC CCTGCTCGGC CCGCCGCCCG CACCGCCCCC TGCTCCTGCT 4 80 

CGCTGCTCGG TCCTGGCTGG GGGCCTCGGG CCCTTCCGGC CGCTCTGGGC CCTGCTGGCC 540 

TTCGCGCTGC CCGCCCTCCT GCTGCTCGGC GCCTACGGCG GCATCTTCGT GGTGGCGCGT 600 

CGCGCTGCCC TGAGGCCCCC ACGGCCGGCG CGCGGGTCCC GACTCCGCTC GGACTCTCTG 660 

GATAGCCGCC TTTCCATCTT GCCGCCGCTC CGGCCTCGCC TGCCCGGGGG CAAGGCGGCC 72 0 

CTGGCCCCAG CGCTGGCCGT GGGCCAATTT GCAGCCTGCT GGCTGCCTTA TGGCTGCGCG 7 80 

TGCCTGGCGC CCGCAGCGCG GGCCGCGGAA GCCGAAGCGG CTGTCACCTG GGTCGCCTAC 840 

TCGGCCTTCG CGGCTCACCC CTTCCTGTAC GGGCTGCTGC AGCGCCCCGT GCGCTTGGCA 900 

CTGGGCCGCC TCTCTCGCCG TGCACTGCCT GGACCTGTGC GGGCCTGCAC TCCGCAAGCC 960 

TGGCACCCGC GGGCACTCTT GCAATGCCTC CAGAGACCCC CAGAGGGCCC TGCCGTAGGC 102 0 

CCTTCTGAGG CTCCAGAACA GACCCCCGAG TTGGCAGGAG GGCGGAGCCC CGCATACCAG 10 80 

GGGCCACCTG AGAGTTCTCT CTCCTGA 1107 

(7) INFORMATION FOR SEQ ID NO : 6 : 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 368 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: not relevant 



(ii) MOLECULE TYPE: protein 



5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Met Ala Asn Ser Thr Gly Leu Asn Ala Ser Glu Val Ala Gly Ser Leu 
15 10 15 

Gly Leu lie Leu Ala Ala Val Val Glu Val Gly Ala Leu Leu Gly Asn 
20 25 30 

10 Gly Ala Leu Leu Val Val Val Leu Arg Thr Pro Gly Leu Arg Asp Ala 

35 40 45 

'■^ Leu Tyr Leu Ala His Leu Cys Val Val Asp Leu Leu Ala Ala Ala Ser 

50 55 60 

^ ^: lie Met Pro Leu Gly Leu Leu Ala Ala Pro Pro Pro Gly Leu Gly Arg 

If' 15 65 70 75 80 

iU 

fn Val Arg Leu Gly Pro Ala Pro Cys Arg Ala Ala Arg Phe Leu Ser Ala 

III 85 90 95 

P Ala Leu Leu Pro Ala Cys Thr Leu Gly Val Ala Ala Leu Gly Leu Ala 

ifl 100 105 110 

□ 

'-■rJ 20 Arg Tyr Arg Leu lie Val His Pro Leu Arg Pro Gly Ser Arg Pro Pro 

5^ 115 120 125 



Pro Val Leu Val Leu Thr Ala Val Trp Ala Ala Ala Gly Leu Leu Gly 
130 135 140 

Ala Leu Ser Leu Leu Gly Pro Pro Pro Ala Pro Pro Pro Ala Pro Ala 
25 145 150 155 160 

Arg Cys Ser Val Leu Ala Gly Gly Leu Gly Pro Phe Arg Pro Leu Trp 
165 170 175 

Ala Leu Leu Ala Phe Ala Leu Pro Ala Leu Leu Leu Leu Gly Ala Tyr 
180 185 190 

30 Gly Gly lie Phe Val Val Ala Arg Arg Ala Ala Leu Arg Pro Pro Arg 

195 200 205 

Pro Ala Arg Gly Ser Arg Leu Arg Ser Asp Ser Leu Asp Ser Arg Leu 
210 215 220 

Ser lie Leu Pro Pro Leu Arg Pro Arg Leu Pro Gly Gly Lys Ala Ala 
35 225 230 235 240 



Leu Ala Pro Ala Leu Ala Val Gly Gin Phe Ala Ala Cys Trp Leu Pro 
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245 250 255 

Tyr Gly Cys Ala Cys Leu Ala Pro Ala Ala Arg Ala Ala Glu Ala Glu 
260 265 270 

Ala Ala Val Thr Trp Val Ala Tyr Ser Ala Phe Ala Ala His Pro Phe 
5 275 280 285 

Leu Tyr Gly Leu Leu Gin Arg Pro Val Arg Leu Ala Leu Gly Arg Leu 
290 295 300 

Ser Arg Arg Ala Leu Pro Gly Pro Val Arg Ala Cys Thr Pro Gin Ala 
305 310 315 320 

10 Trp His Pro Arg Ala Leu Leu Gin Cys Leu Gin Arg Pro Pro Glu Gly 

325 330 335 

(?■ 

Pro Ala Val Gly Pro Ser Glu Ala Pro Glu Gin Thr Pro Glu Leu Ala 
340 345 350 

Gly Gly Arg Ser Pro Ala Tyr Gin Gly Pro Pro Glu Ser Ser Leu Ser 
15 355 360 365 

(8) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1008 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

ATGGAATCAT CTTTCTCATT TGGAGTGATC CTTGCTGTCC TGGCCTCCCT CATCATTGCT 6 0 

25 ACTAACACAC TAGTGGCTGT GGCTGTGCTG CTGTTGATCC ACAAGAATGA TGGTGTCAGT 120 

CTCTGCTTCA CCTTGAATCT GGCTGTGGCT GACACCTTGA TTGGTGTGGC CATCTCTGGC 180 

CTACTCACAG ACCAGCTCTC CAGCCCTTCT CGGCCCACAC AGAAGACCCT GTGCAGCCTG 24 0 

CGGATGGCAT TTGTCACTTC CTCCGCAGCT GCCTCTGTCC TCACGGTCAT GCTGATCACC 3 00 

TTTGACAGGT ACCTTGCCAT CAAGCAGCCC TTCCGCTACT TGAAGATCAT GAGTGGGTTC 3 60 

30 GTGGCCGGGG CCTGCATTGC CGGGCTGTGG TTAGTGTCTT ACCTCATTGG CTTCCTCCCA 42 0 

CTCGGAATCC CCATGTTCCA GCAGACTGCC TACAAAGGGC AGTGCAGCTT CTTTGCTGTA 4 80 

TTTCACCCTC ACTTCGTGCT GACCCTCTCC TGCGTTGGCT TCTTCCCAGC CATGCTCCTC 540 

TTTGTCTTCT TCTACTGCGA CATGCTCAAG ATTGCCTCCA TGCACAGCCA GCAGATTCGA 600 
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AAGATGGAAC ATGCAGGAGC CATGGCTGGA GGTTATCGAT CCCCACGGAC TCCCAGCGAC 660 

TTCAAAGCTC TCCGTACTGT GTCTGTTCTC ATTGGGAGCT TTGCTCTATC CTGGACCCCC 720 

TTCCTTATCA CTGGCATTGT GCAGGTGGCC TGCCAGGAGT GTCACCTCTA CCTAGTGCTG 780 

GAACGGTACC TGTGGCTGCT CGGCGTGGGC AACTCCCTGC TCAACCCACT CATCTATGCC 84 0 

5 TATTGGCAGA AGGAGGTGCG ACTGCAGCTC TACCACATGG CCCTAGGAGT GAAGAAGGTG 900 

CTCACCTCAT TCCTCCTCTT TCTCTCGGCC AGGAATTGTG GCCCAGAGAG GCCCAGGGAA 960 

AGTTCCTGTC ACATCGTCAC TATCTCCAGC TCAGAGTTTG ATGGCTAA ioo8 

(9) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 335 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Met Glu Ser Ser Phe Ser Phe Gly Val He Leu Ala Val Leu Ala Ser 
15 10 15 

Leu He He Ala Thr Asn Thr Leu Val Ala Val Ala Val Leu Leu Leu 
20 25 30 

20 He His Lys Asn Asp Gly Val Ser Leu Cys Phe Thr Leu Asn Leu Ala 

35 40 45 

Val Ala Asp Thr Leu He Gly Val Ala He Ser Gly Leu Leu Thr Asp 
50 55 60 

Gin Leu Ser Ser Pro Ser Arg Pro Thr Gin Lys Thr Leu Cys Ser Leu 
25 65 70 75 80 

Arg Met Ala Phe Val Thr Ser Ser Ala Ala Ala Ser Val Leu Thr Val 
85 90 95 

Met Leu He Thr Phe Asp Arg Tyr Leu Ala He Lys Gin Pro Phe Arg 
100 105 110 



30 



Tyr Leu Lys He Met Ser Gly Phe Val Ala Gly Ala Cys He Ala Gly 
115 120 125 

Leu Trp Leu Val Ser Tyr Leu He Gly Phe Leu Pro Leu Gly He Pro 
130 135 140 

Met Phe Gin Gin Thr Ala Tyr Lys Gly Gin Cys Ser Phe Phe Ala Val 
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145 150 155 160 

Phe His Pro His Phe Val Leu Thr Leu Ser Cys Val Gly Phe Phe Pro 
165 170 175 

Ala Met Leu Leu Phe Val Phe Phe Tyr Cys Asp Met Leu Lys lie Ala 
5 180 185 190 

Ser Met His Ser Gin Gin He Arg Lys Met Glu His Ala Gly Ala Met 
195 200 205 

Ala Gly Gly Tyr Arg Ser Pro Arg Thr Pro Ser Asp Phe Lys Ala Leu 
210 215 220 

10 Arg Thr Val Ser Val Leu He Gly Ser Phe Ala Leu Ser Trp Thr Pro 

225 230 235 240 

Phe Leu lie Thr Gly He Val Gin Val Ala Cys Gin Glu Cys His Leu 
245 250 255 

Tyr Leu Val Leu Glu Arg Tyr Leu Trp Leu Leu Gly Val Gly Asn Ser 
15 260 265 270 

In 

rU Leu Leu Asn Pro Leu He Tyr Ala Tyr Trp Gin Lys Glu Val Arg Leu 

in 275 " 280 285 

3 y 

3 Gin Leu Tyr His Met Ala Leu Gly Val Lys Lys Val Leu Thr Ser Phe 

=3 290 295 300 



Q 20 Leu Leu Phe Leu Ser Ala Arg Asn Cys Gly Pro Glu Arg Pro Arg Glu 

Q 305 310 315 320 

l2 Ser Ser Cys His He Val Thr He Ser Ser Ser Glu Phe Asp Gly 

" 325 330 335 

(10) INFORMATION FOR SEQ ID NO : 9 : 

25 ( i ) SEQXXENCE CHARACTERISTICS : 

(A) LENGTH: 1413 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

30 (ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

ATGGACACTA CCATGGAAGC TGACCTGGGT GCCACTGGCC ACAGGCCCCG CACAGAGCTT 60 

GATGATGAGG ACTCCTACCC CCAAGGTGGC TGGGACACGG TCTTCCTGGT GGCCCTGCTG 12 0 

CTCCTTGGGC TGCCAGCCAA TGGGTTGATG GCGTGGCTGG CCGGCTCCCA GGCCCGGCAT 180 

35 GGAGCTGGCA CGCGTCTGGC GCTGCTCCTG CTCAGCCTGG CCCTCTCTGA CTTCTTGTTC 240 
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CTGGCAGCAG CGGCCTTCCA GATCCTAGAG ATCCGGCATG GGGGACACTG GCCGCTGGGG 300 

ACAGCTGCCT GCCGCTTCTA CTACTTCCTA TGGGGCGTGT CCTACTCCTC CGGCCTCTTC 360 

CTGCTGGCCG CCCTCAGCCT CGACCGCTGC CTGCTGGCGC TGTGCCCACA CTGGTACCCT 420 

GGGCACCGCC CAGTCCGCCT GCCCCTCTGG GTCTGCGCCG GTGTCTGGGT GCTGGCCACA 480 

5 CTCTTCAGCG TGCCCTGGCT GGTCTTCCCC GAGGCTGCCG TCTGGTGGTA CGACCTGGTC 540 

ATCTGCCTGG ACTTCTGGGA CAGCGAGGAG CTGTCGCTGA GGATGCTGGA GGTCCTGGGG 600 

GGCTTCCTGC CTTTCCTCCT GCTGCTCGTC TGCCACGTGC TCACCCAGGC CACAGCCTGT 660 

CGCACCTGCC ACCGCCAACA GCAGCCCGCA GCCTGCCGGG GCTTCGCCCG TGTGGCCAGG 720 

ACCATTCTGT CAGCCTATGT GGTCCTGAGG CTGCCCTACC AGCTGGCCCA GCTGCTCTAC 780 

10 CTGGCCTTCC TGTGGGACGT CTACTCTGGC TACCTGCTCT GGGAGGCCCT GGTCTACTCC 840 

GACTACCTGA TCCTACTCAA CAGCTGCCTC AGCCCCTTCC TCTGCCTCAT GGCCAGTGCC 900 

GACCTCCGGA CCCTGCTGCG CTCCGTGCTC TCGTCCTTCG CGGCAGCTCT CTGCGAGGAG 960 

CGGCCGGGCA GCTTCACGCC CACTGAGCCA CAGACCCAGC TAGATTCTGA GGGTCCAACT 1020 

CTGCCAGAGC CGATGGCAGA GGCCCAGTCA CAGATGGATC CTGTGGCCCA GCCTCAGGTG 1080 

15 AACCCCACAC TCCAGCCACG ATCGGATCCC ACAGCTCAGC CACAGCTGAA CCCTACGGCC 1140 

CAGCCACAGT CGGATCCCAC AGCCCAGCCA CAGCTGAACC TCATGGCCCA GCCACAGTCA 1200 

GATTCTGTGG CCCAGCCACA GGCAGACACT AACGTCCAGA CCCCTGCACC TGCTGCCAGT 1260 

TCTGTGCCCA GTCCCTGTGA TGAAGCTTCC CCAACCCCAT CCTCGCATCC TACCCCAGGG 1320 

GCCCTTGAGG ACCCAGCCAC ACCTCCTGCC TCTGAAGGAG AAAGCCCCAG CAGCACCCCG 13 80 

20 CCAGAGGCGG CCCCGGGCGC AGGCCCCACG TGA 1413 
(11) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 468 amino acids 

(B) TYPE: amino acid 
25 (C) STRANDEDNESS : 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Asp Thr Thr Met Glu Ala Asp Leu Gly Ala Thr Gly His Arg Pro 
30 1 5 10 15 
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Arg Thr Glu Leu Asp Asp Glu Asp Ser Tyr Pro Gin Gly Gly Trp Asp 
20 25 30 

Thr Val Phe Leu Val Ala Leu Leu Leu Leu Gly Leu Pro Ala Asn Gly 
35 40 45 

5 Leu Met Ala Trp Leu Ala Gly Ser Gin Ala Arg His Gly Ala Gly Thr 

50 55 60 

Arg Leu Ala Leu Leu Leu Leu Ser Leu Ala Leu Ser Asp Phe Leu Phe 
65 70 75 80 

Leu Ala Ala Ala Ala Phe Gin lie Leu Glu lie Arg His Gly Gly His 
10 85 90 95 

Trp Pro Leu Gly Thr Ala Ala Cys Arg Phe Tyr Tyr Phe Leu Trp Gly 
100 105 110 

Val Ser Tyr Ser Ser Gly Leu Phe Leu Leu Ala Ala Leu Ser Leu Asp 
115 120 125 

15 Arg Cys Leu Leu Ala Leu Cys Pro His Trp Tyr Pro Gly His Arg Pro 

130 135 140 

Iff Val Arg Leu Pro Leu Trp Val Cys Ala Gly Val Trp Val Leu Ala Thr 

ly 145 150 155 160 

= q Leu Phe Ser Val Pro Trp Leu Val Phe Pro Glu Ala Ala Val Trp Trp 

m 20 165 170 175 



Tyr Asp Leu Val lie Cys Leu Asp Phe Trp Asp Ser Glu Glu Leu Ser 
180 185 190 

Leu Arg Met Leu Glu Val Leu Gly Gly Phe Leu Pro Phe Leu Leu Leu 
195 200 205 

25 Leu Val Cys His Val Leu Thr Gin Ala Thr Arg Thr Cys His Arg Gin 

210 215 220 

Gin Gin Pro Ala Ala Cys Arg Gly Phe Ala Arg Val Ala Arg Thr lie 
225 230 235 240 

Leu Ser Ala Tyr Val Val Leu Arg Leu Pro Tyr Gin Leu Ala Gin Leu 
30 245 250 255 

Leu Tyr Leu Ala Phe Leu Trp Asp Val Tyr Ser Gly Tyr Leu Leu Trp 
260 265 270 

Glu Ala Leu Val Tyr Ser Asp Tyr Leu lie Leu Leu Asn Ser Cys Leu 
275 280 285 

35 Ser Pro Phe Leu Cys Leu Met Ala Ser Ala Asp Leu Arg Thr Leu Leu 

290 295 300 



Arg Ser Val Leu Ser Ser Phe Ala Ala Ala Leu Cys Glu Glu Arg Pro 



AREN-0054 -75- PATENT 

310 315 320 

Gly Ser Phe Thr Pro Thr Glu Pro Gin Thr Gin Leu Asp Ser Glu Gly 
325 330 335 

Pro Thr Leu Pro Glu Pro Met Ala Glu Ala Gin Ser Gin Met Asp Pro 
^ 340 345 350 

Val Ala Gin Pro Gin Val Asn Pro Thr Leu Gin Pro Arg Ser Asp Pro 
355 360 365 

Thr Ala Gin Pro Gin Leu Asn Pro Thr Ala Gin Pro Gin Ser Asp Pro 
370 375 380 

10 Thr Ala Gin Pro Gin Leu Asn Leu Met Ala Gin Pro Gin Ser Asp Ser 

385 390 395 400 

Val Ala Gin Pro Gin Ala Asp Thr Asn Val Gin Thr Pro Ala Pro Ala 
405 410 415 

Ala Ser Ser Val Pro Ser Pro Cys Asp Glu Ala Ser Pro Thr Pro Ser 



15 420 425 430 

ly Ser His Pro Thr Pro Gly Ala Leu Glu Asp Pro Ala Thr Pro Pro Ala 

l|1 435 440 445 

III 

, Ser Glu Gly Glu Ser Pro Ser Ser Thr Pro Pro Glu Ala Ala Pro Gly 

13 450 455 460 



20 Ala Gly Pro Thr 

465 

(12) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1248 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

30 ATGTCAGGGA TGGAAAAACT TCAGAATGCT TCCTGGATCT ACCAGCAGAA ACTAGAAGAT 60 

CCATTCCAGA AACACCTGAA CAGCACCGAG GAGTATCTGG CCTTCCTCTG CGGACCTCGG 120 

CGCAGCCACT TCTTCCTCCC CGTGTCTGTG GTGTATGTGC CAATTTTTGT GGTGGGGGTC 180 

ATTGGCAATG TCCTGGTGTG CCTGGTGATT CTGCAGCACC AGGCTATGAA GACGCCCACC 240 

AACTACTACC TCTTCAGCCT GGCGGTCTCT GACCTCCTGG TCCTGCTCCT TGGAATGCCC 3 00 
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CTGGAGGTCT ATGAGATGTG GCGCAACTAC CCTTTCTTGT TCGGGCCCGT GGGCTGCTAC 3 60 

TTCAAGACGG CCCTCTTTGA GACCGTGTGC TTCGCCTCCA TCCTCAGCAT CACCACCGTC 42 0 

AGCGTGGAGC GCTACGTGGC CATCCTACAC CCGTTCCGCG CCAAACTGCA GAGCACCCGG 4 80 

CGCCGGGCCC TCAGGATCCT CGGCATCGTC TGGGGCTTCT CCGTGCTCTT CTCCCTGCCC 540 

5 AACACCAGCA TCCATGGCAT CAAGTTCCAC TACTTCCCCA ATGGGTCCCT GGTCCCAGGT 600 

TCGGCCACCT GTACGGTCAT CAAGCCCATG TGGATCTACA ATTTCATCAT CCAGGTCACC 660 

TCCTTCCTAT TCTACCTCCT CCCCATGACT GTCATCAGTG TCCTCTACTA CCTCATGGCA 720 

CTCAGACTAA AGAAAGACAA ATCTCTTGAG GCAGATGAAG GGAATGCAAA TATTCAAAGA 780 

CCCTGCAGAA AATCAGTCAA CAAGATGCTG TTTGTCTTGG TCTTAGTGTT TGCTATCTGT 84 0 

10 TGGGCCCCGT TCCACATTGA CCGACTCTTC TTCAGCTTTG TGGAGGAGTG GAGTGAATCC 900 

CTGGCTGCTG TGTTCAACCT CGTCCATGTG GTGTCAGGTG TCTTCTTCTA CCTGAGCTCA 9,60 

GCTGTCAACG CCATTATCTA TAACCTACTG TCTCGCCGCT TCCAGGCAGC ATTCCAGAAT 102 0 

GTGATCTCTT CTTTCCACAA ACAGTGGCAC TCCCAGCATG ACCCACAGTT GCCACCTGCC 10 8 0 

CAGCGGAACA TCTTCCTGAC AGAATGCCAC TTTGTGGAGC TGACCGAAGA TATAGGTCCC 114 0 

15 CAATTCCCAT GTCAGTCATC CATGCACAAC TCTCACCTCC CAACAGCCCT CTCTAGTGAA 12 0 0 

CAGATGTCAA GAACAAACTA TCAAAGCTTC CACTTTAACA AAACCTGA 124 8 

(13) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 415 amino acids 
20 (B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 12 : 

25 Met Ser Gly Met Glu Lys Leu Gin Asn Ala Ser Trp lie Tyr Gin Gin 

15 10 15 

Lys Leu Glu Asp Pro Phe Gin Lys His Leu Asn Ser Thr Glu Glu Tyr 

20 25 30 . 

Leu Ala Phe Leu Cys Gly Pro Arg Arg Ser His Phe Phe Leu Pro Val 
30 35 40 45 

Ser Val Val Tyr Val Pro lie Phe Val Val Gly Val lie Gly Asn Val 
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50 55 60 

Leu Val Cys Leu Val He Leu Gin His Gin Ala Met Lys Thr Pro Thr 
^5 70 75 80 

Asn Tyr Tyr Leu Phe Ser Leu Ala Val Ser Asp Leu Leu Val Leu Leu 
5 85 90 95 

Leu Gly Met Pro Leu Glu Val Tyr Glu Met Trp Arg Asn Tyr Pro Phe 
100 105 110 

Leu Phe Gly Pro Val Gly Cys Tyr Phe Lys Thr Ala Leu Phe Glu Thr 
115 120 125 

10 Val Cys Phe Ala Ser He Leu Ser He Thr Thr Val Ser Val Glu Arg 

130 135 140 

Tyr Val Ala He Leu His Pro Phe Arg Ala Lys Leu Gin Ser Thr Arg 
145 150 155 160 

^9 Arg Ala Leu Arg He Leu Gly He Val Trp Gly Phe Ser Val Leu 
^fI 1^ 165 170 175 

ii^ I 

!p Ser Leu Pro Asn Thr Ser He His Gly He Lys Phe His Tyr Phe 

'""^ 180 185 190 



20 



Pro Asn Gly Ser Leu Val Pro Gly Ser Ala Thr Cys Thr Val He Lys 
195 200 205 

Pro Met Trp He Tyr Asn Phe He He Gin Val Thr Ser Phe Leu Phe 
210 215 220 

Tyr Leu Leu Pro. Met Thr Val He Ser Val Leu Tyr Tyr Leu Met Ala 
225 230 235 240 

Leu Arg Leu Lys Lys Asp Lys Ser Leu Glu Ala Asp Glu Gly Asn Ala 
25 245 250 255 

Asn He Gin Arg Pro Cys Arg Lys Ser Val Asn Lys Met Leu Phe Val 
260 265 270 

Leu Val Leu Val Phe Ala He Cys Trp Ala Pro Phe His He Asp Arg 
275 280 285 

30 Leu Phe Phe Ser Phe Val Glu Glu Trp Ser Glu Ser Leu Ala Ala Val 

290 295 300 

Phe Asn Leu Val His Val Val Ser Gly Val Phe Phe Tyr Leu Ser Ser 
305 310 315 320 

Ala Val Asn Pro He He Tyr Asn Leu Leu Ser Arg Arg Phe Gin Ala 
35 325 330 335 



Ala Phe Gin Asn Val He Ser Ser Phe His Lys Gin Trp His Ser Gin 
340 345 350 
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His ASP Pro Gin Leu Pro Pro Ala Gin Arg Asn He Phe Leu Thr Glu 

360 

Cys His Phe val Glu Leu Thr Glu Asp He Gly Pro Gin Phe Pro Cys 

5 Gin Ser Ser Met His Asn Ser His Leu Pro Thr Ala Leu Ser Ser Glu 

390 395 



400 

Gin Met Ser Arg Thr Asn Tyr Gin Ser Phe His Phe Asn Lys Thr 
405 410 415 

(14) INFORMATION FOR SEQ ID NO: 13: 



10 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1173 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECXILE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
ATGCCAGATA CTAATAGCAC AATCAATTTA TCACTAAGCA CTCGTGTTAC TTTAGCATTT 
TTTATGTCCT TAGTAGCTTT TGCTATAATG CTAGGAAATG CTTTGGTCAT TTTAGCTTTT 
GTGGTGGACA AAAACCTTAG ACATCGAAGT AGTTATTTTT TTCTTAACTT GGCCATCTCT 
20 GACTTCTTTG TGGGTGTGAT CTCCATTCCT TTGTACATCC CTCACACGCT GTTCGAATGG 
GATTTTGGAA AGGAAATCTG TGTATTTTGG CTCACTACTG ACTATCTGTT ATGTACAGCA 
TCTGTATATA ACATTGTCCT CATCAGCTAT GATCGATACC TGTCAGTCTC AAATGCTGTG 
TCTTATAGAA CTCAACATAC TGGGGTCTTG AAGATTGTTA CTCTGATGGT GGCCGTTTGG 
GTGCTGGCCT TCTTAGTGAA TGGGCCAATG ATTCTAGTTT CAGAGTCTTG GAAGGATGAA 
25 GGTAGTGAAT GTGAACCTGG ATTTTTTTCG GAATGGTACA TCCTTGCCAT CACATCATTC 
TTGGAATTCG TGATCCCAGT CATCTTAGTC GCTTATTTCA ACATGAATAT TTATTGGAGC 
CTGTGGAAGC GTGATCATCT CAGTAGGTGC CAAAGCCATC CTGGACTGAC TGCTGTCTCT 
TCCAACATCT GTGGACACTC ATTCAGAGGT AGACTATCTT CAAGGAGATC TCTTTCTGCA 
TCGACAGAAG TTCCTGCATC CTTTCATTCA GAGAGACAGA GGAGAAAGAG TAGTCTCATG 
30 TTTTCCTCAA GAACCAAGAT GAATAGCAAT ACAATTGCTT CCAAAATGGG TTCCTTCTCC 
CAATCAGATT CTGTAGCTCT TCACCAAAGG GAACATGTTG AACTGCTTAG AGCCAGGAGA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
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TTAGCCAAGT CACTGGCCAT TCTCTTAGGG GTTTTTGCTG TTTGCTGGGC TCCATATTCT 96 0 

CTGTTCACAA TTGTCCTTTC ATTTTATTCC TCAGCAACAG GTCCTAAATC AGTTTGGTAT 102 0 

AGAATTGCAT TTTGGCTTCA GTGGTTCAAT TCCTTTGTCA ATCCTCTTTT GTATCCATTG 1080 

TGTCACAAGC GCTTTCAAAA GGCTTTCTTG AAAATATTTT GTATAAAAAA GCAACCTCTA 114 0 

5 CCATCACAAC ACAGTCGGTC AGTATCTTCT TAA 1-^73 
(15) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 90 amino acids 

(B) TYPE: amino acid 
10 (C) STRANDEDNESS : 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 14 : 

Met Pro Asp Thr Asn Ser Thr He Asn Leu Ser Leu Ser Thr Arg Val 
15 10 15 

Thr Leu Ala Phe Phe Met Ser Leu Val Ala Phe Ala He Met Leu Gly 
20 25 30 

Asn Ala Leu Val He Leu Ala Phe Val Val Asp Lys Asn Leu Arg His 
35 40 45 

Arg Ser Ser Tyr Phe Phe Leu Asn Leu Ala He Ser Asp Phe Phe Val 
50 55 60 

Gly Val He Ser He Pro Leu Tyr He Pro His Thr Leu Phe Glu Trp 
^5 70 75 80 

Asp Phe Gly Lys Glu He Cys Val Phe Trp Leu Thr Thr Asp Tyr Leu 
25 85 90 95 

Leu Cys Thr Ala Ser Val Tyr Asn He Val Leu He Ser Tyr Asp Arg 
100 105 110 

Tyr Leu Ser Val Ser Asn Ala Val Ser Tyr Arg Thr Gin His Thr Gly 
115 120 125 

30 Val Leu Lys He Val Thr Leu Met Val Ala Val Trp Val Leu Ala Phe 

130 135 140 

Leu Val Asn Gly Pro Met He Leu Val Ser Glu Ser Trp Lys Asp Glu 
145 150 155 160 

Gly Ser Glu Cys Glu Pro Gly Phe Phe Ser Glu Trp Tyr He Leu Ala 
35 165 170 175 



20 



AREN-0054 . 80 



PATENT 



He Thr Ser Phe Leu Glu Phe Val He Pro Val He Leu Val Ala Tyr 
180 185 190 

Phe Asn Met Asn He Tyr Trp Ser Leu Trp Lys Arg Asp His Leu Ser 
195 200 205 

5 Arg Cys Gin Ser His Pro Gly Leu Thr Ala Val Ser Ser Asn He Cys 

210 215 220 

Gly His Ser Phe Arg Gly Arg Leu Ser Ser Arg Arg Ser Leu Ser Ala 
225 230 235 240 

Ser Thr Glu Val Pro Ala Ser Phe His Ser Glu Arg Gin Arg Arg Lys 
245 250 255 

Ser Ser Leu Met Phe Ser Ser Arg Thr Lys Met Asn Ser Asn Thr He 
260 265 270 

Ala Ser Lys Met Gly Ser Phe Ser Gin Ser Asp Ser Val Ala Leu His 
275 280 285 



15 Gin Arg Glu His Val Glu Leu Leu Arg Ala Arg Arg Leu Ala Lys Ser 

IP 290 295 300 

m 

Iff Leu Ala He Leu Leu Gly Val Phe Ala Val Cys Trp Ala Pro Tyr Ser 

iy 305 310 315 320 

O Leu Pile Thr He Val Leu Ser Phe Tyr Ser Ser Ala Thr Gly Pro Lys 

m 325 330 335 



Ser Val Trp Tyr Arg He Ala Phe Trp Leu Gin Trp Phe Asn Ser Phe 
340 345 350 

Val Asn Pro Leu Leu Tyr Pro Leu Cys His Lys Arg Phe Gin Lys Ala 
355 360 365 

25 Phe Leu Lys He Phe Cys He Lys Lys Gin Pro Leu Pro Ser Gin His 

370 375 380 

Ser Arg Ser Val Ser Ser 
385 390 

(16) INFORMATION FOR SEQ ID N0:15: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

35 (ii) MOLECULE TYPE: DNA (genomic) 

(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
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GGAAAGCTTA ACGATCCCCA GGAGCAACAT 30 

(17) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iv) ANTI- SENSE: YES 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

CTGGGATCCT ACGAGAGCAT TTTTCACACA G 
31 

(18) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 1128 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

ATGGCGAACG CGAGCGAGCC GGGTGGCAGC GGCGGCGGCG AGGCGGCCGC CCTGGGCCTC 60 

AAGCTGGCCA CGCTCAGCCT GCTGCTGTGC GTGAGCCTAG CGGGCAACGT GCTGTTCGCG 12 0 

CTGCTGATCG TGCGGGAGCG CAGCCTGCAC CGCGCCCCGT ACTACCTGCT GCTCGACCTG 18 0 

TGCCTGGCCG ACGGGCTGCG CGCGCTCGCC TGCCTCCCGG CCGTCATGCT GGCGGCGCGG 24 0 

25 CGTGCGGCGG CCGCGGCGGG GGCGCCGCCG GGCGCGCTGG GCTGCAAGCT GCTCGCCTTC 3 00 

CTGGCCGCGC TCTTCTGCTT CCACGCCGCC TTCCTGCTGC TGGGCGTGGG CGTCACCCGC 3 60 

TACCTGGCCA TCGCGCACCA CCGCTTCTAT GCAGAGCGCC TGGCCGGCTG GCCGTGCGCC 420 

GCCATGCTGG TGTGCGCCGC CTGGGCGCTG GCGCTGGCCG CGGCCTTCCC GCCAGTGCTG 480 

GACGGCGGTG GCGACGACGA GGACGCGCCG TGCGCCCTGG AGCAGCGGCC CGACGGCGCC 540 

30 CCCGGCGCGC TGGGCTTCCT GCTGCTGCTG GCCGTGGTGG TGGGCGCCAC GCACCTCGTC 600 

TACCTCCGCC TGCTCTTCTT CATCCACGAC CGCCGCAAGA TGCGGCCCGC GCGCCTGGTG 660 



AREN-0054 - 82 - PATENT 

CCCGCCGTCA GCCACGACTG GACCTTCCAC GGCCCGGGCG CCACCGGCCA GGCGGCCGCC 72 0 

AACTGGACGG CGGGCTTCGG CCGCGGGCCC ACGCCGCCCG CGCTTGTGGG CATCCGGCCC 78 0 

GCAGGGCCGG GCCGCGGCGC GCGCCGCCTC CTCGTGCTGG AAGAATTCAA GACGGAGAAG 84 0 

• AGGCTGTGCA AGATGTTCTA CGCCGTCACG CTGCTCTTCC TGCTCCTCTG GGGGCCCTAC 90 0 

5 GTCGTGGCCA GCTACCTGCG GGTCCTGGTG CGGCCCGGCG CCGTCCCCCA GGCCTACCTG 96 0 

ACGGCCTCCG TGTGGCTGAC CTTCGCGCAG GCCGGCATCA ACCCCGTCGT GTGCTTCCTC 102 0 

TTCAACAGGG AGCTGAGGGA CTGCTTCAGG GCCCAGTTCC CCTGCTGCCA GAGCCCCCGG 108 0 

ACCACCCAGG CGACCCATCC CTGCGACCTG AAAGGCATTG GTTTATGA 112 8 

(19) INFORMATION FOR SEQ ID NO: 18: 

O 10 (i) SEQUENCE CHARACTERISTICS: 

^0 (A) LENGTH: 375 amino acids 

ifi (B) TYPE: amino acid 

'4 (C) STRANDEDNESS : 

ijl (D) TOPOLOGY: not relevant 

a? I 

m 15 (ii) MOLECULE TYPE: protein 

iii 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Met Ala Asn Ala Ser Glu Pro Gly Gly Ser Gly Gly Gly Glu Ala Ala 
15 10 15 

Ala Leu Gly Leu Lys Leu Ala Thr Leu Ser Leu Leu Leu Cys Val Ser 
20 20 25 3 0 

Leu Ala Gly Asn Val Leu Phe Ala Leu Leu He Val Arg Glu Arg Ser 
35 40 45 

Leu His Arg Ala Pro Tyr Tyr Leu Leu Leu Asp Leu Cys Leu Ala Asp 
50 55 60 

25 Gly Leu Arg Ala Leu Ala Cys Leu Pro Ala Val Met Leu Ala Ala Arg 

65 70 75 80 

Arg Ala Ala Ala Ala Ala Gly Ala Pro Pro Gly Ala Leu Gly Cys Lys 
85 90 95 

Leu Leu Ala Phe Leu Ala Ala Leu Phe Cys Phe His Ala Ala Phe Leu 
30 100 105 110 

Leu Leu Gly Val Gly Val Thr Arg Tyr Leu Ala He Ala His His Arg 
115 120 125 

Phe Tyr Ala Glu Arg Leu Ala Gly Trp Pro Cys Ala Ala Met Leu Val 
130 135 140 



MS 
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Cys Ala Ala Trp Ma Leu Ala Leu Ala Ala Ala Phe Pro Pro Val Leu 
145 "5 

.sp Gly Gly Gly Asp Asp Olu Asp Ala Pro Cys Ala Leu Olu Oln Arg 



165 I'^O 



pro ASP Gly Ala Pro Gly Ala Leu Gly Phe Leu Leu Leu Leu Ala Val 

180 

val val Gly Ala Thr His Leu Val Tyr Leu Arg Leu Leu Phe Phe He 

195 

His ASP Arg Arg Lys Met Arg Pro Ala Arg Leu Val Pro Ala Val Ser 

210 

T'V,^ Ph^ His Gly Pro Gly Ala Thr Gly Gin Ala Ala Ala 
His Asp Trp Thr Phe His ^xy rtu ^ j 

225 

Trp thr Al. .ly Phe OXy Arg Gly Pro Thr Pro Pro Ala Leu v.l 

245 

Gly lie Ar^ Pro Ala oly Pro Oly Ar| Gly Ala Arg Arg L,u 1.U Val 

260 265 
.eu Glu Glu Phe Lys Thr Glu Lys Arg Leu Cys Lys Met Phe Tyr Ala 
275 280 

riv Pro Tvr Val Val Ala Ser 
val Thr Leu Leu Phe Leu Leu Leu Trp Gly Pro Tyr 

290 295 
^ r,eu Arg val 1^. Val Arg Pro Gly Ala val Pro Gin Ala Tyr 

310 

Z Ala sar val Trp Leu Thr Phe Ala Gl„ Ala Gly He A,„ Pro v.l 



32S 



val cy. Ph. t.eu Phe A,« Arg Glu Leu Arg ASP Cya Ph. Arg Ala Gl„ 

340 

A^r, TV.T- Thr Gin Ala Thr His Pro Cys 
Phe pro cys Cys Gin Ser Pro Arg Thr Thr Gin 



355 

^ Asp Leu Lys Gly He Gly Leu 
30 370 

(2 0) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1002 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 



ATGAACACCA 


CAGTGATGCA 


AGGCTTCAAC 


AGATCTGAGC 


GGTGCCCCAG 


AGACACTCGG 


60 


ATAGTACAGC 


TGGTATTCCC 


AGCCCTCTAC 


ACAGTGGTTT 


TCTTGACCGG 


CATCCTGCTG 


120 


AATACTTTGG 


CTCTGTGGGT 


GTTTGTTCAC 


ATCCCCAGCT 


CCTCCACCTT 


CATCATCTAC 


180 


CTCAAAAACA 


CTTTGGTGGC 


CGACTTGATA 


ATGACACTCA 


TGCTTCCTTT 


CAAAATCCTC 


240 


TCTGACTCAC 


ACCTGGCACC 


CTGGCAGCTC 


AGAGCTTTTG 


TGTGTCGTTT 


TTCTTCGGTG 


300 


ATATTTTATG 


AGACCATGTA 


TGTGGGCATC 


GTGCTGTTAG 


GGCTCATAGC 


CTTTGACAGA 


360 


TTCCTCAAGA 


TCATCAGACC 


TTTGAGAAAT 


ATTTTTCTAA 


AAAAACCTGT 


TTTTGCAAAA 


420 


ACGGTCTCAA 


TCTTCATCTG 


GTTCTTTTTG 


TTCTTCATCT 


CCCTGCCAAA 


TACGATCTTG 


480 


AGCAACAAGG 


AAGCAACACC 


ATCGTCTGTG 


AAAAAGTGTG 


CTTCCTTAAA 


GGGGCCTCTG 


540 


GGGCTGAAAT 


GGCATCAAAT 


GGTAAATAAC 


ATATGCCAGT 


TTATTTTCTG 


GACTGTTTTT 


600 


ATCCTAATGC 


TTGTGTTTTA 


TGTGGTTATT 


GCAAAAAAAG 


TATATGATTC 


TTATAGAAAG 


*660 


TCCAAAAGTA 


AGGACAGAAA 


AAACAACAAA 


AAGCTGGAAG 


GCAAAGTATT 


TGTTGTCGTG 


720 


GCTGTCTTCT 


TTGTGTGTTT 


TGCTCCATTT 


CATTTTGCCA 


GAGTTCCATA 


TACTCACAGT 


780 


CAAACCAACA 


ATAAGACTGA 


CTGTAGACTG 


CAAAATCAAC 


TGTTTATTGC 


TAAAGAAACA 


840 


ACTCTCTTTT 


TGGCAGCAAC 


TAACATTTGT 


ATGGATCCCT 


TAATATACAT 


ATTCTTATGT 


900 


AAAAAATTCA 


CAGAAAAGCT 


ACCATGTATG 


CAAGGGAGAA 


AGACCACAGC 


ATCAAGCCAA 


960 


GAAAATCATA 


GCAGTCAGAC 


AGACAACATA 


ACCTTAGGCT 


GA 




1002 



(21) INFORMATION FOR SEQ ID NO: 20: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 3 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNES S : 

(D) TOPOLOGY: not relevant 

25 (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Met Asn Thr Thr Val Met Gin Gly Phe Asn Arg Ser Glu Arg Cys Pro 
1 5 10 15 

Arg Asp Thr Arg He Val Gin Leu Val Phe Pro Ala Leu Tyr Thr Val 
30 20 25 30 



m 



3 5*!! 
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Val Phe Leu Thr Gly lie Leu Leu Asn Thr Leu Ala Leu Trp Val Phe 
35 40 45 

Val His lie Pro Ser Ser Ser Thr Phe lie lie Tyr Leu Lys Asn Thr 
50 55 60 

5 Leu Val Ala Asp Leu lie Met Thr Leu Met Leu Pro Phe Lys lie Leu 

65 70 75 80 

Ser Asp Ser His Leu Ala Pro Trp Gin Leu Arg Ala Phe Val Cys Arg 
85 90 95 

Phe Ser Ser Val lie Phe Tyr Glu Thr Met Tyr Val Gly He Val Leu 
10 100 105 110 

Leu Gly Leu He Ala Phe Asp Arg Phe Leu Lys He He Arg Pro Leu 
115 120 125 

Arg Asn He Phe Leu Lys Lys Pro Val Phe Ala Lys Thr Val Ser He 
130 135 140 

15 Phe He Trp Phe Phe Leu Phe Phe He Ser Leu Pro Asn Thr He Leu 



Fi 145 150 155 160 



Ser Asn Lys Glu Ala Thr Pro Ser Ser Val Lys Lys Cys Ala Ser Leu 
165 170 175 

Lys Gly Pro Leu Gly Leu Lys Trp His Gin Met Val Asn Asn He Cys 
20 180 185 190 

Gin Phe He Phe Trp Thr Val Phe He Leu Met Leu Val Phe Tyr Val 
195 200 205 

Val He Ala Lys Lys Val Tyr Asp Ser Tyr Arg Lys Ser Lys Ser Lys 
210 215 220 

25 Asp Arg Lys Asn Asn Lys Lys Leu Glu Gly Lys Val Phe Val Val Val 

225 230 235 240 

Ala Val Phe Phe Val Cys Phe Ala Pro Phe His Phe Ala Arg Val Pro 
245 250 255 

Tyr Thr His Ser Gin Thr Asn Asn Lys Thr Asp Cys Arg Leu Gin Asn 
30 260 265 270 

Gin Leu Phe He Ala Lys Glu Thr Thr Leu Phe Leu Ala Ala Thr Asn 
275 280 285 

He Cys Met Asp Pro Leu He Tyr He Phe Leu Cys Lys Lys Phe Thr 
290 295 300 

35 Glu Lys Leu Pro Cys Met Gin Gly Arg Lys Thr Thr Ala Ser Ser Gin 

305 310 315 320 

Glu Asn His Ser Ser Gin Thr Asp Asn He Thr Leu Gly 
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(22) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1122 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

10 ATGGCCAACA CTACCGGAGA GCCTGAGGAG GTGAGCGGCG CTCTGTCCCC ACCGTCCGCA 60 

TCAGCTTATG TGAAGCTGGT ACTGCTGGGA CTGATTATGT GCGTGAGCCT GGCGGGTAAC 120 

GCCATCTTGT CCCTGCTGGT GCTCAAGGAG CGTGCCCTGC ACAAGGCTCC TTACTACTTC 180 

CTGCTGGACC TGTGCCTGGC CGATGGCATA CGCTCTGCCG TCTGCTTCCC CTTTGTGCTG ,240 

GCTTCTGTGC GCCACGGCTC TTCATGGACC TTCAGTGCAC TCAGCTGCAA GATTGTGGCC 3 00 

15 TTTATGGCCG TGCTCTTTTG CTTCCATGCG GCCTTCATGC TGTTCTGCAT CAGCGTCACC 3 60 

CGCTACATGG CCATCGCCCA CCACCGCTTC TACGCCAAGC GCATGACACT CTGGACATGC 420 

GCGGCTGTCA TCTGCATGGC CTGGACCCTG TCTGTGGCCA TGGCCTTCCC ACCTGTCTTT 4 80 

GACGTGGGCA CCTACAAGTT TATTCGGGAG GAGGACCAGT GCATCTTTGA GCATCGCTAC 540 

TTCAAGGCCA ATGACACGCT (SGGCTTCATG CTTATGTTGG CTGTGCTCAT GGCAGCTACC 600 

20 CATGCTGTCT ACGGCAAGCT GCTCCTCTTC GAGTATCGTC ACCGCAAGAT GAAGCCAGTG 660 

CAGATGGTGC CAGCCATCAG CCAGAACTGG ACATTCCATG GTCCCGGGGC CACCGGCCAG 720 

GCTGCTGCCA ACTGGATCGC CGGCTTTGGC CGTGGGCCCA TGCCACCAAC CCTGCTGGGT 780 

ATCCGGCAGA ATGGGCATGC AGCCAGCCGG CGGCTACTGG GCATGGACGA GGTCAAGGGT 84 0 

GAAAAGCAGC TGGGCCGCAT GTTCTACGCG ATCACACTGC TCTTTCTGCT CCTCTGGTCA 900 

25 CCCTACATCG TGGCCTGCTA CTGGCGAGTG TTTGTGAAAG CCTGTGCTGT GCCCCACCGC 960 

TACCTGGCCA CTGCTGTTTG GATGAGCTTC GCCCAGGCTG CCGTCAACCC AATTGTCTGC 102 0 

TTCCTGCTCA ACAAGGACCT CAAGAAGTGC CTGACCACTC ACGCCCCCTG CTGGGGCACA 1080 

GGAGGTGCCC CGGCTCCCAG AGAACCCTAC TGTGTCATGT GA 1122 



(23) INFORMATION FOR SEQ ID NO:22: 
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PATENT 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 73 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: DNA (genomic) 



10 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

Met Ala Asn Thr Thr Gly Glu Pro Glu Glu Val Ser Gly Ala Leu Ser 
15 10 15 

Pro Pro Ser Ala Ser Ala Tyr Val Lys Leu Val Leu Leu Gly Leu He 
20 25 30 

Met Cys Val Ser Leu Ala Gly Asn Ala He Leu Ser Leu Leu Val Leu 
35 40 45 

Lys Glu Arg Ala Leu His Lys Ala Pro Tyr Tyr Phe Leu Leu Asp Leu 
15 50 55 60 

Cys Leu Ala Asp Gly He Arg Ser Ala Val Cys Phe Pro Phe Val Leu 
^5 70 75 80 

Ala Ser Val Arg His Gly Ser Ser Trp Thr Phe Ser Ala Leu Ser Cys 
85 90 95 

20 Lys He Val Ala Phe Met Ala Val Leu Phe Cys Phe His Ala Ala Phe 

100 105 110 

Met Leu Phe Cys He Ser Val Thr Arg Tyr Met Ala He Ala His His 
115 120 125 

Arg Phe Tyr Ala Lys Arg Met Thr Leu Trp Thr Cys Ala Ala Val He 
25 130 135 140 

Cys Met Ala Trp Thr Leu Ser Val Ala Met Ala Phe Pro Pro Val Phe 
145 150 155 160 

Asp Val Gly Thr Tyr Lys Phe He Arg Glu Glu Asp Gin Cys He Phe 
165 170 175 

30 Glu His Arg Tyr Phe Lys Ala Asn Asp Thr Leu Gly Phe Met Leu Met 

180 185 190 

Leu Ala Val Leu Met Ala Ala Thr His Ala Val Tyr Gly Lys Leu Leu 
195 200 205 

Leu Phe Glu Tyr Arg His Arg Lys Met Lys Pro Val Gin Met Val Pro 
35 210 215 220 



Ala He Ser Gin Asn Trp Thr Phe His Gly Pro Gly Ala Thr Gly Gin 
225 230 235 240 
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Ala Ala Ala Asn Trp He Ala Gly Phe Gly Arg Gly Pro Met Pro Pro 
245 250 255 

Thr Leu Leu Gly He Arg Gin Asn Gly His Ala Ala Ser Arg Arg Leu 
260 265 270 

5 Leu Gly Met Asp Glu Val Lys Gly Glu Lys Gin Leu Gly Arg Met Phe 

280 285 

Tyr Ala He Thr Leu Leu Phe Leu Leu Leu Trp Ser Pro Tyr He Val 
290 295 300 

Ala Cys Tyr Trp Arg Val Phe Val Lys Ala Cys Ala Val Pro His Arq 
305 310 315 320 

Tyr Leu Ala Thr Ala Val Trp Met Ser Phe Ala Gin Ala Ala Val Asn 
325 330 335 

Pro He Val Cys Phe Leu Leu Asn Lys Asp Leu Lys Lys Cys Leu Thr 
340 345 350 



15 



Thr His Ala Pro Cys Trp Gly Thr Gly Gly Ala Pro Ala Pro Arg Glu 
355 360 365 



Pro Tyr Cys Val Met 
370 

(24) INFORMATION FOR SEQ ID NO:23: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1053 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

ATGGCTTTGG AACAGAACCA GTCAACAGAT TATTATTATG AGGAAAATGA AATGAATGGC 60 

ACTTATGACT ACAGTCAATA TGAATTGATC TGTATCAAAG AAGATGTCAG AGAATTTGCA 12 0 

AAAGTTTTCC TCCCTGTATT CCTCACAATA GCTTTCGTCA TTGGACTTGC AGGCAATTCC 18 0 

30 ATGGTAGTGG CAATTTATGC CTATTACAAG AAACAGAGAA CCAAAACAGA TGTGTACATC 24 0 

CTGAATTTGG CTGTAGCAGA TTTACTCCTT CTATTCACTC TGCCTTTTTG GGCTGTTAAT 3 00 

GCAGTTCATG GGTGGGTTTT AGGGAAAATA ATGTGCAAAA TAACTTCAGC CTTGTACACA 360 

CTAAACTTTG TCTCTGGAAT GCAGTTTCTG GCTTGCATCA GCATAGACAG ATATGTGGCA 42 0 

GTAACTAATG TCCCCAGCCA ATCAGGAGTG GGAAAACCAT GCTGGATCAT CTGTTTCTGT 4 80 
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GTCTGGATGG CTGCCATCTT GCTGAGCATA CCCCAGCTGG TTTTTTATAC AGTAAATGAC 54 0 

AATGCTAGGT GCATTCCCAT TTTCCCCCGC TACCTAGGAA CATCAATGAA AGCATTGATT 600 

CAAATGCTAG AGATCTGCAT TGGATTTGTA GTACCCTTTC TTATTATGGG GGTGTGCTAC 660 

TTTATCACGG CAAGGACACT CATGAAGATG CCAAACATTA AAATATCTCG ACCCCTAAAA 720 

5 GTTCTGCTCA CAGTCGTTAT AGTTTTCATT GTCACTCAAC TGCCTTATAA CATTGTCAAG 780 

TTCTGCCGAG CCATAGACAT CATCTACTCC CTGATCACCA GCTGCAACAT GAGCAAACGC 84 0 

ATGGACATCG CCATCCAAGT CACAGAAAGC ATTGCACTCT TTCACAGCTG CCTCAACCCA 900 

ATCCTTTATG TTTTTATGGG AGCATCTTTC AAAAACTACG TTATGAAAGT GGCCAAGAAA 960 

TATGGGTCCT GGAGAAGACA GAGACAAAGT GTGGAGGAGT TTCCTTTTGA TTCTGAGGGT 1020 

10 CCTACAGAGC CAACCAGTAC TTTTAGCATT TAA 1053 
(25) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 350 amino acids 

(B) TYPE: amino acid 
15 (C) STRANDEDNESS : 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Met Ala Leu Glu Gin Asn Gin Ser Thr Asp Tyr Tyr Tyr Glu Glu Asn 
20 1 5 10 15 

Glu Met Asn Gly Thr Tyr Asp Tyr Ser Gin Tyr Glu Leu lie Cys lie 
20 25 30 

Lys Glu Asp Val Arg Glu Phe Ala Lys Val Phe Leu Pro Val Phe Leu 
35 40 45 

25 Thr He Ala Phe Val He Gly Leu Ala Gly Asn Ser Met Val Val Ala 

50 55 60 

He Tyr Ala Tyr Tyr Lys Lys Gin Arg Thr Lys Thr Asp Val Tyr He 
65 70 75 80 

Leu Asn Leu Ala Val Ala Asp Leu Leu Leu Leu Phe Thr Leu Pro Phe 
30 85 90 95 

Trp Ala Val Asn Ala Val His Gly Trp Val Leu Gly Lys He Met Cys 
100 105 110 

Lys He Thr Ser Ala Leu Tyr Thr Leu Asn Phe Val Ser Gly Met Gin 
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115 120 125 

Phe Leu Ala Cys lie Ser lie Asp Arg Tyr Val Ala Val Thr Asn Val 
130 135 140 

Pro Ser Gin Ser Gly Val Gly Lys Pro Cys Trp He He Cys Phe Cys 
5 145 150 155 160 

Val Trp Met Ala Ala He Leu Leu Ser He Pro Gin Leu Val Phe Tyr 
165 170 175 

Thr Val Asn Asp Asn Ala Arg Cys He Pro He Phe Pro Arg Tyr Leu 
180 185 190 

10 Gly Thr Ser Met Lys Ala Leu He Gin Met Leu Glu He Cys He Gly 

195 200 205 

Phe Val Val Pro Phe Leu He Met Gly Val Cys Tyr Phe He Thr Ala 
210 215 220 

Arg Thr Leu Met Lys Met Pro Asn He Lys He Ser Arg Pro Leu Lys 
15 225 230 235 240 

Val Leu Leu Thr Val Val He Val Phe He Val Thr Gin Leu Pro Tyr 
245 250 255 

Asn He Val Lys Phe Cys Arg Ala He Asp He He Tyr Ser Leu He 
260 265 270 

20 Thr Ser Cys Asn Met Ser Lys Arg Met Asp He Ala He Gin Val Thr 

275 280 285 

Glu Ser He Ala Leu Phe His Ser Cys Leu Asn Pro He Leu Tyr Val 
290 295 300 

Phe Met Gly Ala Ser Phe Lys Asn Tyr Val Met Lys Val Ala Lys Lys 
25 305 310 315 320 

Tyr Gly Ser Trp Arg Arg Gin Arg Gin Ser Val Glu Glu Phe Pro Phe 
325 330 335 

Asp Ser Glu Gly Pro Thr Glu Pro Thr Ser Thr Phe Ser He 
340 345 350 

30 (26) INFORMATION FOR SEQ ID NO: 25: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1116 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
35 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 





ATGCCAGGAA 


ACGCCACCCC 


AGTGACCACC 


ACTGCCCCGT 


GGGCCTCCCT 


GGGCCTCTCC 


60 




GCCAAGACCT 


GCAACAACGT 


GTCCTTCGAA 


GAGAGCAGGA 


TAGTCCTGGT 


CGTGGTGTAC 


120 




AGCGCGGTGT 


GCACGCTGGG 


GGTGCCGGCC 


AACTGCCTGA 


CTGCGTGGCT 


GGCGCTGCTG 


180 


5 


CAGGTACTGC 


AGGGCAACGT 


GCTGGCCGTC 


TACCTGCTCT 


GCCTGGCACT 


CTGCGAACTG 


240 




CTGTACACAG 


GCACGCTGCC 


ACTCTGGGTC 


ATCTATATCC 


GCAACCAGCA 


CCGCTGGACC 


300 




CTAGGCCTGC 


TGGCCTCGAA 


GGTGACCGCC 


TACATCTTCT 


TCTGCAACAT 


CTACGTCAGC 


360 




ATCCTCTTCC 


TGTGCTGCAT 


CTCCTGCGAC 


CGCTTCGTGG 


CCGTGGTGTA 


CGCGCTGGAG 


420 




AGTCGGGGCC 


GCCGCCGCCG 


GAGGACCGCC 


ATCCTCATCT 


CCGCCTGCAT 


CTTCATCCTC 


480 


10, 


GTCGGGATCG 


TTCACTACCC 


GGTGTTCCAG 


ACGGAAGACA 


AGGAGACCTG 


CTTTGACATG 


540 




CTGCAGATGG 


ACAGCAGGAT 


TGCCGGGTAC 


TACTACGCCA 


GGTTCACCGT 


TGGCTTTGCC 


600 




ATCCCTCTCT 


CCATCATCGC 


CTTCACCAAC 


CACCGGATTT 


TCAGGAGCAT 


CAAGCAGAGC 


660 




ATGGGCTTAA 


GCGCTGCCCA 


GAAGGCCAAG 


GTGAAGCACT 


CGGCCATCGC 


GGTGGTTGTC 


720 




ATCTTCCTAG 


TCTGCTTCGC 


CCCGTACCAC 


CTGGTTCTCC 


TCGTCAAAGC 


CGCTGCCTTT 


780 


15 


TCCTACTACA 


GAGGAGACAG 


GAACGCCATG 


TGCGGCTTGG 






84 0 




TCTGTGGTGT 


TTCTGTGCCT 


GTCCACGGTG 


AACGGCGTGG 


CTGACCCCAT 


TATCTACGTG 


900 




CTGGCCACGG 


ACCATTCCCG 


CCAAGAAGTG 


TCCAGAATCC 


ATAAGGGGTG 


GAAAGAGTGG 


960 




TCCATGAAGA 


CAGACGTCAC 


CAGGCTCACC 


CACAGCAGGG 


ACACCGAGGA 


GCTGCAGTCG 


1020 




CCCGTGGCCC 


TTGCAGACCA 


CTACACCTTC 


TCCAGGCCCG 


TGCACCCACC 


AGGGTCACCA 


1080 


20 


TGCCCTGCAA 


AGAGGCTGAT 


TGAGGAGTCC 


TGCTGA 






1116 



(28) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 371 amino acids 

(B) TYPE: amino acid 
25 (C) STRANDEDNESS : 

(D) TOPOLOGY: not relevant 

(ii) MOLECtJLE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Met Pro Gly Asn Ala Thr Pro Val Thr Thr Thr Ala Pro Trp Ala Ser 
15 10 15 
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Leu Gly Leu Ser Ala Lys Thr Cys Asn Asn Val Ser Phe Glu Glu Ser 
20 25 30 

Arg lie Val Leu Val Val Val Tyr Ser Ala Val Cys Thr Leu Gly Val 
35 40 45 



5 Pro Ala Asn Cys Leu Thr Ala Trp Leu Ala Leu Leu Gin Val Leu Gin 

50 55 60 

Gly Asn Val Leu Ala Val Tyr Leu Leu Cys Leu Ala Leu Cys Glu Leu 
65 70 75 80 

Leu Tyr Thr Gly Thr Leu Pro Leu Trp Val lie Tyr lie Arg Asn Gin 
10 85 90 95 

His Arg Trp Thr Leu Gly Leu Leu Ala Ser Lys Val Thr Ala Tyr lie 
100 105 110 

Phe Phe Cys Asn lie Tyr Val Ser lie Leu Phe Leu Cys Cys lie Ser 
115 120 125 

15 Cys Asp Arg Phe Val Ala Val Val Tyr Ala Leu Glu Ser Arg Gly Arg 

130 135 140 

Arg Arg Arg Arg Thr Ala lie Leu lie Ser Ala Cys lie Phe lie Leu 
145 150 155 160 

Val Gly lie Val His Tyr Pro Val Phe Gin Thr Glu Asp Lys Glu Thr 
20 165 170 175 

Cys Phe Asp Met Leu Gin Met Asp Ser Arg lie Ala Gly Tyr Tyr Tyr 
180 185 190 

Ala Arg Phe Thr Val Gly Phe Ala lie Pro Leu Ser lie lie Ala Phe 
195 200 205 

25 Thr Asn His Arg lie Phe Arg Ser lie Lys Gin Ser Met Gly Leu Ser 

210 215 220 

Ala Ala Gin Lys Ala Lys Val Lys His Ser Ala lie Ala Val Val Val 
225 230 235 240 

lie Phe Leu Val Cys Phe Ala Pro Tyr His Leu Val Leu Leu Val Lys 
30 245 250 255 

Ala Ala Ala Phe Ser Tyr Tyr Arg Gly Asp Arg Asn Ala Met Cys Gly 
260 265 270 

Leu Glu Glu Arg Leu Tyr Thr Ala Ser Val Val Phe Leu Cys Leu Ser 
275 280 285 

35 Thr Val Asn Gly Val Ala Asp Pro lie lie Tyr Val Leu Ala Thr Asp 

290 295 300 



AREN-0054 . 93 . 

His Ser Arg Gin Glu Val Ser Arg He His Lys Gly Trp Lys 
305 310 315 

Ser Met Lys Thr Asp Val Thr Arg Leu Thr His Ser Arg Asp 
325 330 

5 Glu Leu Gin Ser Pro Val Ala Leu Ala Asp His Tyr Thr Phe 

340 345 

Pro Val His Pro Pro Gly Ser Pro Cys Pro Ala Lys Arg Leu 
355 360 365 

Glu Ser Cys 
10 370 

(2 8) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1113 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 27: 





ATGGCGAACT 


ATAGCCATGC 


AGCTGACAAC 


ATTTTGCAAA 


ATCTCTCGCC 


TCTAACAGCC 


60 


20 


TTTCTGAAAC 


TGACTTCCTT 


GGGTTTCATA 


ATAGGAGTCA 


GCGTGGTGGG 


CAACCTCCTG 


120 




ATCTCCATTT 


TGCTAGTGAA 


AGATAAGACC 


TTGCATAGAG 


CACCTTACTA 


CTTCCTGTTG 


180 




GATCTTTGCT 


GTTCAGATAT 


CCTCAGATCT 


GCAATTTGTT 


TCCCATTTGT 


GTTCAACTCT 


240 




GTCAAAAATG 


GCTCTACCTG 


GACTTATGGG 


ACTCTGACTT 


GCAAAGTGAT 


TGCCTTTCTG 


300 




GGGGTTTTGT 


CCTGTTTCCA 


CACTGCTTTC 


ATGCTCTTCT 


GCATCAGTGT 


CACCAGATAC 


360 


25 


TTAGCTATCG 


CCCATCACCG 


CTTCTATACA 


AAGAGGCTGA 


CCTTTTGGAC 


GTGTCTGGCT 


420 




GTGATCTGTA 


TGGTGTGGAC 


TCTGTCTGTG 


GCCATGGCAT 


TTCCCCCGGT 


TTTAGACGTG 


480 




GGCACTTACT 


CATTCATTAG 


GGAGGAAGAT 


CAATGCACCT 


TCCAACACCG 


CTCCTTCAGG 


540 




GCTAATGATT 


CCTTAGGATT 


TATGCTGCTT 


CTTGCTCTCA 


TCCTCCTAGC 


CACACAGCTT 


600 




GTCTACCTCA 


AGCTGATATT 


TTTCGTCCAC 


GATCGAAGAA 


AAATGAAGCC 


AGTCCAGTTT 


660 


30 


GTAGCAGCAG 


TCAGCCAGAA 


CTGGACTTTT 


CATGGTCCTG 


GAGCCAGTGG 


CCAGGCAGCT 


720 




GCCAATTGGC 


TAGCAGGATT 


TGGAAGGGGT 


CCCACACCAC 


CCACCTTGCT 


GGGCATCAGG 


780 




CAAAATGCAA 


ACACCACAGG 


CAGAAGAAGG 


CTATTGGTCT 


TAGACGAGTT 


CAAAATGGAG 


840 



PATENT 

Glu Trp 
320 

Thr Glu 
335 

Ser Arg 
lie Glu 
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AAAAGAATCA GCAGAATGTT CTATATAATG ACTTTTCTGT TTCTAACCTT GTGGGGCCCC 900 

TACCTGGTGG CCTGTTATTG GAGAGTTTTT GCAAGAGGGC CTGTAGTACC AGGGGGATTT 960 

CTAACAGCTG CTGTCTGGAT GAGTTTTGCC CAAGCAGGAA TCAATCCTTT TGTCTGCATT 102 0 

TTCTCAAACA GGGAGCTGAG GCGCTGTTTC AGCACAACCC TTCTTTACTG CAGAAAATCC 10 80 

5 AGGTTACCAA GGGAACCTTA CTGTGTTATA TGA 1113 
(2 9) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 370 amino acids 

(B) TYPE: amino acid 
10 (C) STRANDEDNESS : 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Met Ala Asn Tyr Ser His Ala Ala Asp Asn lie Leu Gin Asn Leu Ser 
15 1 5 10 15 

Pro Leu Thr Ala Phe Leu Lys Leu Thr Ser Leu Gly Phe lie lie Gly 
20 25 30 

Val Ser Val Val Gly Asn Leu Leu lie Ser lie Leu Leu Val Lys Asp 
35 40 45 



20 Lys Thr Leu His Arg Ala Pro Tyr Tyr Phe Leu Leu Asp Leu Cys Cys 

50 55 60 

Ser Asp lie Leu Arg Ser Ala lie Cys Phe Pro Phe Val Phe Asn Ser 

65 70 75 80 

Val Lys Asn Gly Ser Thr Trp Thr Tyr Gly Thr Leu Thr Cys Lys Val 
25 85 90 95 

lie Ala Phe Leu Gly Val Leu Ser Cys Phe His Thr Ala Phe Met Leu 
100 105 110 

Phe Cys lie Ser Val Thr Arg Tyr Leu Ala lie Ala His His Arg Phe 
115 120 125 

30 Tyr Thr Lys Arg Leu Thr Phe Trp Thr Cys Leu Ala Val lie Cys Met 

130 135 140 

Val Trp Thr Leu Ser Val Ala Met Ala Phe Pro Pro Val Leu Asp Val 

145 150 155 160 

Gly Thr Tyr Ser Phe lie Arg Glu Glu Asp Gin Cys Thr Phe Gin His 
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165 170 175 

Arg Ser Phe Arg Ala Asn Asp Ser Leu Gly Phe Met Leu Leu Leu Ala 
180 185 190 

Leu lie Leu Leu Ala Thr Gin Leu Val Tyr Leu Lys Leu lie Phe Phe 
5 195 200 205 

Val His Asp Arg Arg Lys Met Lys Pro Val Gin Phe Val Ala Ala Val 
210 215 220 

Ser Gin Asn Trp Thr Phe His Gly Pro Gly Ala Ser Gly Gin Ala Ala 
225 230 235 240 

10 Ala Asn Trp Leu Ala Gly Phe Gly Arg Gly Pro Thr Pro Pro Thr Leu 

245 250 255 

Leu Gly lie Arg Gin Asn Ala Asn Thr Thr Gly Arg Arg Arg Leu Leu 
260 265 270 

Val Leu Asp Glu Phe Lys Met Glu Lys Arg He Ser Arg Met Phe Tyr 
15 275 280 285 

He Met Thr Phe Leu Phe Leu Thr Leu Trp Gly Pro Tyr Leu Val Ala 
290 295 300 

Cys Tyr Trp Arg Val Phe Ala Arg Gly Pro Val Val Pro Gly Gly Phe 
305 310 315 320 

20 Leu Thr Ala Ala Val Trp Met Ser Phe Ala Gin Ala Gly He Asn Pro 

325 330 335 

Phe Val Cys He Phe Ser Asn Arg Glu Leu Arg Arg Cys Phe Ser Thr 
340 345 350 

Thr Leu Leu Tyr Cys Arg Lys Ser Arg Leu Pro Arg Glu Pro Tyr Cys 
25 355 360 365 

Val He 

^ 370 

(3 0) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 108 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 9 : 

ATGCAGGTCC CGAACAGCAC CGGCCCGGAC AACGCGACGC TGCAGATGCT GCGGAACCCG 



60 
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.C™o ^CCCT^CC .„CXO.T^ ec^COTCO CTCCCOOCC 

«ccTcr.cT ^cr^, acr^occ^ aocro^a c««t.ccc .tcotcatc 

TTC™.T« .CC^„cOT C.C=«CCT= .XS=T«CC. OCOTO^OC TTTC««TC 
TACTACCATT OC«CCa=CA CCACT<=«=TA TTCCOCO^ ^CT^CAA COTOOTOACC 
5 OT<=.CCTT„ ACOCA«„T OTATTCCA^ ATCCTCACCA XOACCTCTAT CAOCO^o 
CaCTTCCT<=0 .^TCCTOTA <:C=OCTCAac TCCAAOC.CT «=a=CCCCCO TCTTAC^CC 
OTOOCCOCCT OT^CAOGOAC CT«.CTCCT. CTCCI^CCC .CCrOTOCC OC^CCCOC 
ACC^TCTCA CCTACCCOGT CCACCCCCTO OOCATCATCA CCTCCTTCCA COTCCTCAAO 
TGCAC0AT3C TCCCCAOCOT OOCCAT^TOO CCCOTOTTCC TCTTCACCAT CTTCATCCTO 
.0 CTOTTCCTCA TCCCOTTCOT OATCACCaTO CCTTOTTACA C«KcaCC:AT CCTCAA0CT3 
™CCCACOO AO^ACOC^CA C==CC,«OAC CAOCGCA^^c .C=C<«.»^ cCT<=0=a3C3 
<n«OTCTT.C TCKCTTTCT CACC^CTTC .^CCaACA ACTTCGT^CT CCT«.COCAC 
ATCOTCA^CC OCCTOTTOTA O^CAAOAOC TACTACCACa TOTACAA^CT CAC^CTCHOT 

ctc:acctocc: tcaacaacto tct»3accco ttto™ a<™c<.tc cc«==aattc 

>5 CAOCTCCGCC TOC«=aAATA Tra^CMC CCCC0«3Tac CCAGAOACAC CCTCOACACa 

coccacoAOA cccTcrrcrc cocca-^oacc acotccot^ .ctcccagoc c:oot3c=cac lo.o 

CCT=AACOCA TOGAOGOAOC CACCACOCCC COCCTCCAaA <;OCAOOAaAO TOTGTTCTOA XOSO 

(31) INFORMATION FOR SEQ ID NO: 30: 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 



20 



25 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 359 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

Met Gin Val Pro Asn Ser Thr Gly Pro Asp Asn Ala Thr Leu Gin Met 
5 10 ^5 

Leu Arg Asn Pro Ala He Ala Val Ala Leu Pro Val Val lyr Ser Leu 

^5 30 
val Ala Ala Val Ser lie Pro Gly Asn Leu Phe Ser Leu Trp Val Leu 



35 



40 



45 



Cys Ar< 
50 




Met Gly Pro Arg Ser Pro Ser 
55 



|Ile Phe Met lie Asn 
60 



Leu Ser Val Thr Asp Leu Met Leu Ala Ser Val Leu Pro Phe Gin lie 

65 70 75 80 

Tyr Tyr His Cys Asn Arg His His Trp Val Phe Gly Val Leu Leu Cys 

85 90 95 

Asn Val Val Thr Val Ala Phe Tyr Ala Asn Met Tyr Ser Ser lie Leu 

100 105 110 

Thr Met Thr Cys lie Ser Val Glu Arg Phe Leu Gly Val Leu Tyr Pro 



Leu Ser Ser Lys Arg Trp Arg Arg Arg Arg Tyr Ala Val Ala Ala Cys 
130 135 140 

Ala Gly Thr Trp Leu Leu Leu Leu Thr Ala Leu Cys Pro Leu Ala Arg 
145 150 155 16Q 

Thr Asp Leu Thr Tyr Pro Val His Ala Leu Gly He He Thr Cys Phe 
165 170 175 

Asp Val Leu Lys Trp Thr Met Leu Pro Ser Val Ala Met Trp Ala Val 
180 185 190 

Phe Leu Phe Thr He Phe He Leu Leu Phe Leu He Pro Phe Val He 
195 200 205 

Thr Val Ala Cys Tyr Thr Ala Thr He Leu Lys Leu Leu Arg Thr Glu 
210 215 220 

Glu Ala His Gly Arg Glu Gin Arg Arg Arg Ala Val Gly Leu Ala Ala 
225 230 235 240 

Val Val Leu Leu Ala Phe Val Thr Cys Phe Ala Pro Asn Asn Phe Val 
245 250 255 

Leu Leu Ala His He Val Ser Arg Leu Phe Tyr Gly Lys Ser Tyr Tyr 
260 265 270 

His Val Tyr Lys Leu Thr Leu Cys Leu Ser Cys Leu Asn Asn Cys Leu 
275 280 285 

Asp Pro Phe Val Tyr Tyr Phe Ala Ser Arg Glu Phe Gin Leu Arg Leu 
290 295 300 

Arg Glu Tyr Leu Gly Cys Arg Arg Val Pro Arg Asp Thr Leu Asp Thr 
305 310 315 320 

Arg Arg Glu Ser Leu Phe Ser Ala Arg Thr Thr Ser Val Arg Ser Glu 



115 



120 



125 



325 



330 



335 
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Ala Gly Ala His 
340 



Pro Glu Gly Met Glu Gly Ala Thr Arg Pro Gly Leu 
345 350 



Gin Arg Gin Glu 
355 



Ser Val Phe 



5 



(32) INFORMATION FOR 



SEQ ID NO: 31: 



(i) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1503 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



10 



(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
ATGGAGCGTC CCTGGGAGGA CAGCCCAGGC CCGGAGGGGG CAGCTGAGGG CTCGCCTGTG 
CCAGTCGCCG CCGGGGCGCG CTCCGGTGCC GCGGCGAGTG GCACAGGCTG GCAGCCATGG 
15 GCTGAGTGCC CGGGACCCAA GGGGAGGGGG CAACTGCTGG CGACCGCCGG CCCTTTGCGT 
CGCTGGCCCG CCCCCTCGCC TGCCAGCTCC AGCCCCGCCC CCGGAGCGGC GTCCGCTCAC 
TCGGTTCAAG GCAGCGCGAC TGCGGGTGGC GCACGACCAG GGCGCAGACC TTGGGGCGCG 
CGGCCCATGG AGTCGGGGCT GCTGCGGCCG GCGCCGGTGA GCGAGGTCAT CGTCCTGCAT 
TACAACTACA CCGGCAAGCT CCGCGGTGCG AGCTACCAGC CGGGTGCCGG CCTGCGCGCC 
20 GACGCCGTGG TGTGCCTGGC GGTGTGCGCC TTCATCGTGC TAGAGAATCT AGCCGTGTTG 
TTGGTGCTCG GACGCCACCC GCGCTTCCAC GCTCCCATGT TCCTGCTCCT GGGCAGCCTC 
ACGTTGTCGG ATCTGCTGGC AGGCGCCGCC TACGCCGCCA ACATCCTACT GTCGGGGCCG 
CTCACGCTGA AACTGTCCCC CGCGCTCTGG TTCGCACGGG AGGGAGGCGT CTTCGTGGCA 
CTCACTGCGT CCGTGCTGAG CCTCCTGGCC ATCGCGCTGG AGCGCAGCCT CACCATGGCG 
25 CGCAGGGGGC CCGCGCCCGT CTCCAGTCGG GGGCGCACGC TGGCGATGGC AGCCGCGGCC 
TGGGGCGTGT CGCTGCTCCT CGGGCTCCTG CCAGCGCTGG GCTGGAATTG CCTGGGTCGC 
CTGGACGCTT GCTCCACTGT CTTGCCGCTC TACGCCAAGG CCTACGTGCT CTTCTGCGTG 
CTCGCCTTCG TGGGCATCCT GGCCGCGATC TGTGCACTCT ACGCGCGCAT CTACTGCCAG 
GTACGCGCCA ACGCGCGGCG CCTGCCGGCA CGGCCCGGGA CTGCGGGGAC CACCTCGACC 
30 CGGGCGCGTC GCAAGCCGCG CTCTCTGGCC TTGCTGCGCA CGCTCAGCGT GGTGCTCCTG 



60 
. 120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
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GCCTTTGTGG CATGTTGGGG CCCCCTCTTC CTGCTGCTGT TGCTCGACGT GGCGTGCCCG 114 0 

GCGCGCACCT GTCCTGTACT CCTGCAGGCC GATCCCTTCC TGGGACTGGC CATGGCCAAC 12 00 

TCACTTCTGA ACCCCATCAT CTACACGCTC ACCAACCGCG ACCTGCGCCA CGCGCTCCTG 12 60 

CGCCTGGTCT GCTGCGGACG CCACTCCTGC GGCAGAGACC CGAGTGGCTC CCAGCAGTCG 13 2 0 

5 GCGAGCGCGG CTGAGGCTTC CGGGGGCCTG CGCCGCTGCC TGCCCCCGGG CCTTGATGGG 13 80 

AGCTTCAGCG GCTCGGAGCG CTCATCGCCC CAGCGCGACG GGCTGGACAC CAGCGGCTCC 144 0 

ACAGGCAGCC CCGGTGCACC CACAGCCGCC CGGACTCTGG TATCAGAACC GGCTGCAGAC 15 00 
TGA 

(33) INFORMATION FOR SEQ ID NO: 32: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 500 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: not relevant 

15 (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 

Met Glu Arg Pro Trp Glu Asp Ser Pro Gly Pro Glu Gly Ala Ala Glu 
15 10 15 

Gly Ser Pro Val Pro Val Ala Ala Gly Ala Arg Ser Gly Ala Ala Ala 
20 20 25 30 

Ser Gly Thr Gly Trp Gin Pro Trp Ala Glu Cys Pro Gly Pro Lys Gly 
35 40 45 

Arg Gly Gin Leu Leu Ala Thr Ala Gly Pro Leu Arg Arg Trp Pro Ala 
50 55 60 

25 Pro Ser Pro Ala Ser Ser Ser Pro Ala Pro Gly Ala Ala Ser Ala His 

65 70 75 80 

Ser Val Gin Gly Ser Ala Thr Ala Gly Gly Ala Arg Pro Gly Arg Arg 
85 90 95 

Pro Trp Gly Ala Arg Pro Met Glu Ser Gly Leu Leu Arg Pro Ala Pro 
30 100 105 110 

Val Ser Glu Val lie Val Leu His Tyr Asn Tyr Thr Gly Lys Leu Arg 
115 120 125 

Gly Ala Ser Tyr Gin Pro Gly Ala Gly Leu Arg Ala Asp Ala Val Val 
130 135 140 
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Cys Leu Ala Val 
145 

Leu Val Leu Gly 



5 Leu Gly Ser Leu 

180 

Ala Asn lie Leu 
195 



Cys Ala Phe lie 
150 

Arg His Pro Arg 
165 

Thr Leu Ser Asp 



Leu Ser Gly Pro 
200 



Val Leu Glu Asn 
155 

Phe His Ala Pro 
170 

Leu Leu Ala Gly 
185 

Leu Thr Leu Lys 



Leu Ala Val Leu 
160 

Met Phe Leu Leu 
175 

Ala Ala Tyr Ala 
190 

Leu Ser Pro Ala 
205 



Leu Trp Phe Ala Arg Glu Gly Gly Val Phe Val Ala Leu Thr Ala Ser 
10 210 215 220 

Val Leu Ser Leu Leu Ala lie Ala Leu Glu Arg Ser Leu Thr Met Ala 
225 230 235 240 

Arg Arg Gly Pro Ala Pro Val Ser Ser Arg Gly Arg Thr Leu Ala Met 
245 250 255 

15 Ala Ala Ala Ala Trp Gly Val Ser Leu Leu Leu Gly Leu Leu Pro Al^ 

260 265 270 

Leu Gly Trp Asn Cys Leu Gly Arg Leu Asp Ala Cys Ser Thr Val Leu 
275 280 285 

"■"^^ Pro Leu Tyr Ala Lys Ala Tyr Val Leu Phe Cys Val Leu Ala Phe Val 

i-^^ 20 290 295 300 

H PS? 

""'4 Gly lie Leu Ala Ala He Cys Ala Leu Tyr Ala Arg He Tyr Cys Gin 

□ 305 310 315 320 



Hi 



Val Arg Ala Asn Ala Arg Arg Leu Pro Ala Arg Pro Gly Thr Ala Gly 
325 330 335 

25 Thr Thr Ser Thr Arg Ala Arg Arg Lys Pro Arg Ser Leu Ala Leu Leu 

340 345 350 



Arg Thr Leu Ser 
355 

Leu Phe Leu Leu 
30 370 

Pro Val Leu Leu 
385 

Ser Leu Leu Asn 



35 His Ala Leu Leu 

420 



Val Val Leu Leu 
360 

Leu Leu Leu Asp 
375 

Gin Ala Asp Pro 
390 

Pro He He Tyr 
405 

Arg Leu Val Cys 



Ala Phe Val Ala 



Val Ala Cys Pro 
380 

Phe Leu Gly Leu 
395 

Thr Leu Thr Asn 
410 

Cys Gly Arg His 
425 



Cys Trp Gly Pro 
365 

Ala Arg Thr Cys 



Ala Met Ala Asn 
400 

Arg Asp Leu Arg 
415 

Ser Cys Gly Arg 
430 



Asp Pro Ser Gly Ser Gin Gin Ser Ala Ser Ala Ala Glu Ala Ser Gly 
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435 440 445 

Gly Leu Arg Arg Cys Leu Pro Pro Gly Leu Asp Gly Ser Phe Ser Gly 
450 455 460 

Ser Glu Arg Ser Ser Pro Gin Arg Asp Gly Leu Asp Thr Ser Gly Ser 
5 465 470 475 480 

Thr Gly Ser Pro Gly Ala Pro Thr Ala Ala Arg Thr Leu Val Ser Glu 
485 490 495 

Pro Ala Ala Asp 
500 

10 (34) INFORMATION FOR SEQ ID NO:33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1029 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

ATGCAAGCCG TCGACAATCT CACCTCTGCG CCTGGGAACA CCAGTCTGTG CACCAGAGAC 60 

TACAAAATCA CCCAGGTCCT CTTCCCACTG CTCTACACTG TCCTGTTTTT TGTTGGACTT 12 0 

20 ATCACAAATG GCCTGGCGAT GAGGATTTTC TTTCAAATCC GGAGTAAATC AAACTTTATT 180 

ATTTTTCTTA AGAACACAGT CATTTCTGAT CTTCTCATGA TTCTGACTTT TCCATTCAAA 240 

ATTCTTAGTG ATGCCAAACT GGGAACAGGA CCACTGAGAA CTTTTGTGTG TCAAGTTACC 3 00 

TCCGTCATAT TTTATTTCAC AATGTATATC AGTATTTCAT TCCTGGGACT GATAACTATC 3 60 

GATCGCTACC AGAAGACCAC CAGGCCATTT AAAACATCCA ACCCCAAAAA TCTCTTGGGG 420 

25 GCTAAGATTC TCTCTGTTGT CATCTGGGCA TTCATGTTCT TACTCTCTTT GCCTAACATG 4 80 

ATTCTGACCA ACAGGCAGCC GAGAGACAAG AATGTGAAGA AATGCTCTTT CCTTAAATCA 540 

GAGTTCGGTC TAGTCTGGCA TGAAATAGTA AATTACATCT GTCAAGTCAT TTTCTGGATT 600 

AATTTCTTAA TTGTTATTGT ATGTTATACA CTCATTACAA AAGAACTGTA CCGGTCATAC 660 

GTAAGAACGA GGGGTGTAGG TAAAGTCCCC AGGAAAAAGG TGAACGTCAA AGTTTTCATT 72 0 

30 ATCATTGCTG TATTCTTTAT TTGTTTTGTT CCTTTCCATT TTGCCCGAAT TCCTTACACC 780 

CTGAGCCAAA CCCGGGATGT CTTTGACTGC ACTGCTGAAA ATACTCTGTT CTATGTGAAA 84 0 
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GAGAGCACTC TGTGGTTAAC TTCCTTAAAT GCATGCCTGG ATCCGTTCAT CTATTTTTTC 900 
CTTTGCAAGT CCTTCAGAAA TTCCTTGATA AGTATGCTGA AGTGCCCCAA TTCTGCAACA 960 
TCTCTGTCCC AGGACAATAG GAAAAAAGAA CAGGATGGTG GTGACCCAAA TGAAGAGACT 102 0 
CCAATGTAA 

1029 

5 (3 5) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 342 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

10 (D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

Met Gin Ala Val Asp Asn Leu Thr Ser Ala Pro Gly Asn Thr Ser Leu- 



15 



15 10 15 

Cys Thr Arg Asp Tyr Lys He Thr Gin Val Leu Phe Pro Leu Leu Tyr 
20 25 30 

Thr Vai Leu Phe Phe Val Gly Leu He Thr Asn Gly Leu Ala Met Arg 
p 35 40 45 

Gin He Arg Ser Lys Ser Asn Phe He He Phe Leu Lvs 

li ^° ^0 

Asn Thr Val He Ser Asp Leu Leu Met He Leu Thr Phe Pro Phe Lys 
^5 70 75 80 

He Leu Ser Asp Ala Lys Leu Gly Thr Gly Pro Leu Arg Thr Phe Val 
85 90 95 

25 Cys Gin Val Thr Ser Val He Phe Tyr Phe Thr Met Tyr He Ser He 

100 105 110 

Ser Phe Leu Gly Leu He Thr He Asp Arg Tyr Gin Lys Thr Thr Arg 
115 120 125 

Pro Phe Lys Thr Ser Asn Pro Lys Asn Leu Leu Gly Ala Lys He Leu 
30 130 135 140 

Ser Val Val He Trp Ala Phe Met Phe Leu Leu Ser Leu Pro Asn Met 
145 150 155 160 

He Leu Thr Asn Arg Gin Pro Arg Asp Lys Asn Val Lys Lys Cys Ser 
165 170 175 



35 



Phe Leu Lys Ser Glu Phe Gly Leu Val Trp His Glu He Val Asn Tyr 



m 



m 

i n 
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180 185 190 

He Cys Gin Val He Phe Trp He Asn Phe Leu He Val He Val Cys 
195 200 205 

Tyr Thr Leu He Thr Lys Glu Leu Tyr Arg Ser Tyr Val Arg Thr Arg 
5 210 215 220 

Gly Val Gly Lys Val Pro Arg Lys Lys Val Asn Val Lys Val Phe He 
225 230 235 240 

He He Ala Val Phe Phe He Cys Phe Val Pro Phe His Phe Ala Arg 
245 250 255 

10 He Pro Tyr Thr Leu Ser Gin Thr Arg Asp Val Phe Asp Cys Thr Ala 

260 265 270 

Glu Asn Thr Leu Phe Tyr Val Lys Glu Ser Thr Leu Trp Leu Thr Ser 
275 280 285 

Leu Asn Ala Cys Leu Asp Pro Phe He Tyr Phe Phe Leu Cys Lys Ser 
15 290 295 300 

Phe Arg Asn Ser Leu He Ser Met Leu Lys Cys Pro Asn Ser Ala Thr 
305 310 315 320 

Ser Leu Ser Gin Asp Asn Arg Lys Lys Glu Gin Asp Gly Gly Asp Pro 
325 330 335 

20 Asn Glu Glu Thr Pro Met 

340 

(36) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1077 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:35: 

30 ATGTCGGTCT GCTACCGTCC CCCAGGGAAC GAGACACTGC TGAGCTGGAA GACTTCGCGG 60 

GCCACAGGCA CAGCCTTCCT GCTGCTGGCG GCGCTGCTGG GGCTGCCTGG CAACGGCTTC 12 0 

GTGGTGTGGA GCTTGGCGGG CTGGCGGCCT GCACGGGGGC GACCGCTGGC GGCCACGCTT 18 0 

GTGCTGCACC TGGCGCTGGC CGACGGCGCG GTGCTGCTGC TCACGCCGCT CTTTGTGGCC 24 0 

TTCCTGACCC GGCAGGCCTG GCCGCTGGGC CAGGCGGGCT GCAAGGCGGT GTACTACGTG 3 00 
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TGCGCGCTCA GCATGTACGC CAGCGTGCTG CTCACCGGCC TGCTCAGCCT GCAGCGCTGC 360 

CTCGCAGTCA CCCGCCCCTT CCTGGCGCCT CGGCTGCGCA GCCCGGCCCT GGCCCGCCGC 42 0 

CTGCTGCTGG CGGTCTGGCT GGCCGCCCTG TTGCTCGCCG TCCCGGCCGC CGTCTACCGC 4 80 

CACCTGTGGA GGGACCGCGT ATGCCAGCTG TGCCACCCGT CGCCGGTCCA CGCCGCCGCC 54 0 

5 CACCTGAGCC TGGAGACTCT GACCGCTTTC GTGCTTCCTT TCGGGCTGAT GCTCGGCTGC 60 0 

TACAGCGTGA CGCTGGCACG GCTGCGGGGC GCCCGCTGGG GCTCCGGGCG GCACGGGGCG 660 

CGGGTGGGCC GGCTGGTGAG CGCCATCGTG CTTGCCTTCG GCTTGCTCTG GGCCCCCTAC 72 0 

CACGCAGTCA ACCTTCTGCA GGCGGTCGCA GCGCTGGCTC CACCGGAAGG GGCCTTGGCG 780 

AAGCTGGGCG GAGCCGGCCA GGCGGCGCGA GCGGGAACTA CGGCCTTGGC CTTCTTCAGT 840 

10 TCTAGCGTCA ACCCGGTGCT CTACGTCTTC ACCGCTGGAG ATCTGCTGCC CCGGGCAGGT 900 

CCCCGTTTCC TCACGCGGCT CTTCGAAGGC TCTGGGGAGG CCCGAGGGGG CGGCCGCTCT 960 

AGGGAAGGGA CCATGGAGCT CCGAACTACC CCTCAGCTGA AAGTGGTGGG GCAGGGCCGC 1020 

GGCAATGGAG ACCCGGGGGG TGGGATGGAG AAGGACGGTC CGGAATGGGA CCTTTGA 1077 
(37) INFORMATION FOR SEQ ID NO: 36: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 358 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: not relevant 

20 (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

Met Ser Val Cys Tyr Arg Pro Pro Gly Asn Glu Thr Leu Leu Ser Trp 
1 5 10 15 

Lys Thr Ser Arg Ala Thr Gly Thr Ala Phe Leu Leu Leu Ala Ala Leu 
25 20 25 30 

Leu Gly Leu Pro Gly Asn Gly Phe Val Val Trp Ser Leu Ala Gly Trp 
35 40 45 

Arg Pro Ala Arg Gly Arg Pro Leu Ala Ala Thr Leu Val Leu His Leu 
50 55 60 

30 Ala Leu Ala Asp Gly Ala Val Leu Leu Leu Thr Pro Leu Phe Val Ala 

65 70 75 80 

Phe Leu Thr Arg Gin Ala Trp Pro Leu Gly Gin Ala Gly Cys Lys Ala 



m 



AREN-0054 . 105 - PATENT 

85 90 95 

Val Tyr Tyr Val Cys Ala Leu Ser Met Tyr Ala Ser Val Leu Leu Thr 
100 105 

Gly Leu Leu Ser Leu Gin Arg Cys Leu Ala Val Thr Arg Pro Phe Leu 
5 115 120 125 

Ala Pro Arg Leu Arg Ser Pro Ala Leu Ala Arg Arg Leu Leu Leu Ala 
130 135 140 

Val Trp Leu Ala Ala Leu Leu Leu Ala Val Pro Ala Ala Val Tyr Arg 

150 155 160 



10 



20 



30 



His Leu Trp Arg Asp Arg Val Cys Gin Leu Cys His Pro Ser Pro Val 
165 170 175 

His Ala Ala Ala His Leu Ser Leu Glu Thr Leu Thr Ala Phe Val Leu 

180 185 190 



^ P^ro Phe Gly Leu Met Leu Gly Cys Tyr Ser Val Thr Leu Ala Arg Leu 

^ 15 195 200 205 

y Arg Gly Ala Arg Trp Gly Ser Gly Arg His Gly Ala Arg Val Gly Arg 

n 210 215 220 

y 

Leu Val Ser Ala lie Val Leu Ala Phe Gly Leu Leu Trp Ala Pro Tyr 
3 225 230 235 240 

His Ala Val Asn Leu Leu Gin Ala Val Ala Ala Leu Ala Pro Pro Glu 
245 250 255 

Gly Ala Leu Ala Lys Leu Gly Gly Ala Gly Gin Ala Ala Arg Ala Gly 
260 265 270 

Thr Thr Ala Leu Ala Phe Phe Ser Ser Ser Val Asn Pro Val Leu Tyr 
25 275 280 285 

Val Phe Thr Ala Gly Asp Leu Leu Pro Arg Ala Gly Pro Arg Phe Leu 
290 295 300 

Thr Arg Leu Phe Glu Gly Ser Gly Glu Ala Arg Gly Gly Gly Arg Ser 
305 310 315 320 



Arg Glu Gly Thr Met Glu Leu Arg Thr Thr Pro Gin Leu Lys Val Val 
325 330 335 

Gly Gin Gly Arg Gly Asn Gly Asp Pro Gly Gly Gly Met Glu Lys Asp 
340 345 350 

Gly Pro Glu Trp Asp Leu 
35 355 



(3 8) INFORMATION FOR SEQ ID NO: 37: 
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(i) SEQUENCE CHARACTERISTICS': 

(A) LENGTH: 1005 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
5 ( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:37: 

ATGCTGGGGA TCATGGCATG GAATGCAACT TGCAAAAACT GGCTGGCAGC AGAGGCTGCC 60 

CTGGAAAAGT ACTACCTTTC CATTTTTTAT GGGATTGAGT TCGTTGTGGG AGTCCTTGGA 12 0 

10 AATACCATTG TTGTTTACGG CTACATCTTC TCTCTGAAGA ACTGGAACAG CAGTAATATT 180 

TATCTCTTTA ACCTCTCTGT CTCTGACTTA GCTTTTCTGT GCACCCTCCC CATGCTGATA 240 

AGGAGTTATG CCAATGGAAA CTGGATATAT GGAGACGTGC TCTGCATAAG CAACCGATAT 300 

GTGCTTCATG CCAACCTCTA TACCAGCATT CTCTTTCTCA CTTTTATCAG CATAGATCGA - 3 60 

TACTTGATAA TTAAGTATCC TTTCCGAGAA CACCTTCTGC AAAAGAAAGA GTTTGCTATT 42 0 

15 TTAATCTCCT TGGCCATTTG GGTTTTAGTA ACCTTAGAGT TACTACCCAT ACTTCCCCTT 4 80 

ATAAATCCTG TTATAACTGA CAATGGCACC ACCTGTAATG ATTTTGCAAG TTCTGGAGAC 54 0 

CCCAACTACA ACCTCATTTA CAGCATGTGT CTAACACTGT TGGGGTTCCT TATTCCTCTT 600 

TTTGTGATGT GTTTCTTTTA TTACAAGATT GCTCTCTTCC TAAAGCAGAG GAATAGGCAG 660 

GTTGCTACTG CTCTGCCCCT TGAAAAGCCT CTCAACTTGG TCATCATGGC AGTGGTAATC 72 0 

20 TTCTCTGTGC TTTTTACACC CTATCACGTC ATGCGGAATG TGAGGATCGC TTCACGCCTG 780 

GGGAGTTGGA AGCAGTATCA GTGCACTCAG GTCGTCATCA ACTCCTTTTA CATTGTGACA 840 

CGGCCTTTGG CCTTTCTGAA CAGTGTCATC AACCCTGTCT TCTATTTTCT TTTGGGAGAT 900 

CACTTCAGGG ACATGCTGAT GAATCAACTG AGACACAACT TCAAATCCCT TACATCCTTT 960 

AGCAGATGGG CTCATGAACT CCTACTTTCA TTCAGAGAAA AGTGA 1005 

25 (3 9) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 334 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

30 (D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

Met Leu Gly lie Met Ala Trp Asn Ala Thr Cys Lys Asn Trp Leu Ala 
15 10 15 

Ala Glu Ala Ala Leu Glu Lys Tyr Tyr Leu Ser lie Phe Tyr Gly lie 
5 20 25 30 

Glu Phe Val Val Gly Val Leu Gly Asn Thr He Val Val Tyr Gly Tyr 
35 40 45 

He Phe Ser Leu Lys Asn Trp Asn Ser Ser Asn He Tyr Leu Phe Asn 

50 55 60 



10 Leu Ser Val Ser Asp Leu Ala Phe Leu Cys Thr Leu Pro Met Leu He 

65 70 75 80 

Arg Ser Tyr Ala Asn Gly Asn Trp He Tyr Gly Asp Val Leu Cys He 
85 90 95 



lU- 
ifi 



Ser Asn Arg Tyr Val Leu His Ala Asn Leu Tyr Thr Ser He Leu Phe 
15 100 105 110 

Leu Thr Phe He Ser He Asp Arg Tyr Leu He He Lys Tyr Pro Phe 
115 120 125 

Arg Glu His Leu Leu Gin Lys Lys Glu Phe Ala He Leu He Ser Leu 
130 135 140 

20 Ala He Trp Val Leu Val Thr Leu Glu Leu Leu Pro He Leu Pro Leu 

145 150 155 160 

He Asn Pro Val He Thr Asp Asn Gly Thr Thr Cys Asn Asp Phe Ala 
165 170 175 

Ser Ser Gly Asp Pro Asn Tyr Asn Leu He Tyr Ser Met Cys Leu Thr 
25 180 185 190 

Leu Leu Gly Phe Leu He Pro Leu Phe Val Met Cys Phe Phe Tyr Tyr 
195 200 205 

Lys He Ala Leu Phe Leu Lys Gin Arg Asn Arg Gin Val Ala Thr Ala 
210 215 220 

30 Leu Pro Leu Glu Lys Pro Leu Asn Leu Val He Met Ala Val Val He 

225 230 235 240 

Phe Ser Val Leu Phe Thr Pro Tyr His Val Met Arg Asn Val Arg He 
245 250 255 

Ala Ser Arg Leu Gly Ser Trp Lys Gin Tyr Gin Cys Thr Gin Val Val 
35 260 265 270 



He Asn Ser Phe Tyr He Val Thr Arg Pro Leu Ala Phe Leu Asn Ser 
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275 



280 



285 



Val lie Asn Pro Val Phe Tyr Phe Leu Leu Gly Asp His Phe Axg Asp 
290 295 300 



Met Leu Met Asn Gin Leu Arg His Asn Phe Lys Ser Leu Thr Ser Phe 
305 310 315 320 



Ser Arg Trp Ala His Glu Leu Leu Leu Ser Phe Arg Glu Lys 
325 330 



3"S 

m 

m 15 

III 

r =3.' 



(40) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQtJENCE CHARACTERISTICS: 

(A) LENGTH: 1296 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

ATGCAGGCGC TTAACATTAC CCCGGAGCAG TTCTCTCGGC TGCTGCGGGA CCACAACCTG 60 

ACGCGGGAGC AGTTCATCGC TCTGTACCGG CTGCGACCGC TCGTCTACAC CCCAGAGCTG 12 0 

CCGGGACGCG CCAAGCTGGC CCTCGTGCTC ACCGGCGTGC TCATCTTCGC CCTGGCGCTC 180 

TTTGGCAATG CTCTGGTGTT CTACGTGGTG ACCCGCAGCA AGGCCATGCG CACCGTCACC 240 

AACATCTTTA TCTGCTCCTT GGCGCTCAGT GACCTGCTCA TCACCTTCTT CTGCATTCCC 3 00 

GTCACCATGC TCCAGAACAT TTCCGACAAC TGGCTGGGGG GTGCTTTCAT TTGCAAGATG 3 60 

GTGCCATTTG TCCAGTCTAC CGCTGTTGTG ACAGAAATGC TCACTATGAC CTGCATTGCT 42 0 

GTGGAAAGGC ACCAGGGACT TGTGCATCCT TTTAAAATGA AGTGGCAATA CACCAACCGA 480 

AGGGCTTTCA CAATGCTAGG TGTGGTCTGG CTGGTGGCAG TCATCGTAGG ATCACCCATG 54 0 

TGGCACGTGC AACAACTTGA GATCAAATAT GACTTCCTAT ATGAAAAGGA ACACATCTGC 60 0 

TGCTTAGAAG AGTGGACCAG CCCTGTGCAC CAGAAGATCT ACACCACCTT CATCCTTGTC 660 

ATCCTCTTCC TCCTGCCTCT TATGGTGATG CTTATTCTGT ACAGTAAAAT TGGTTATGAA 72 0 

CTTTGGATAA AGAAAAGAGT TGGGGATGGT TCAGTGCTTC GAACTATTCA TGGAAAAGAA 7 80 

ATGTCCAAAA TAGCCAGGAA GAAGAAACGA GCTGTCATTA TGATGGTGAC AGTGGTGGCT 84 0 

CTCTTTGCTG TGTGCTGGGC ACCATTCCAT GTTGTCCATA TGATGATTGA ATACAGTAAT 900 

TTTGAAAAGG AATATGATGA TGTCACAATC AAGATGATTT TTGCTATCGT GCAAATTATT 960 



i y 
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GGATTTTCCA ACTCCATCTG TAATCCCATT GTCTATGCAT TTATGAATGA AAACTTCAAA 1020 

AAAAATGTTT TGTCTGCAGT TTGTTATTGC ATAGTAAATA AAACCTTCTC TCCAGCACAA 1080 

AGGCATGGAA ATTCAGGAAT TACAATGATG CGGAAGAAAG CAAAGTTTTC CCTCAGAGAG 114 0 

AATCCAGTGG AGGAAACCAA AGGAGAAGCA TTCAGTGATG GCAACATTGA AGTCAAATTG 1200 

5 TGTGAACAGA CAGAGGAGAA GAAAAAGCTC AAACGACATC TTGCTCTCTT TAGGTCTGAA 1260 

CTGGCTGAGA ATTCTCCTTT AGACAGTGGG CATTAA 12 96 

(41) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 431 amino acids 
10 (B) TYPE: amino acid 

( C ) STRANDEDNES S : 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

15 Met Gin Ala Leu Asn lie Thr Pro Glu Gin Phe Ser Arg Leu Leu Arg 

15 10 15 

Asp His Asn Leu Thr Arg Glu Gin Phe He Ala Leu Tyr Arg Leu Arg 
20 25 30 

Pro Leu Val Tyr Thr Pro Glu Leu Pro Gly Arg Ala Lys Leu Ala Leu 
20 35 40 45 

Val Leu Thr Gly Val Leu He Phe Ala Leu Ala Leu Phe Gly Asn Ala 
50 55 60 

Leu Val Phe Tyr Val Val Thr Arg Ser Lys Ala Met Arg Thr Val Thr 
65 70 75 80 

25 Asn He Phe He Cys Ser Leu Ala Leu Ser Asp Leu Leu He Thr Phe 

85 90 95 

Phe Cys He Pro Val Thr Met Leu Gin Asn He Ser Asp Asn Trp Leu 
100 105 110 

Gly Gly Ala Phe He Cys Lys Met Val Pro Phe Val Gin Ser Thr Ala 
30 115 120 125 

Val Val Thr Glu Met Leu Thr Met Thr Cys He Ala Val Glu Arg His 
130 135 140 

Gin Gly Leu Val His Pro Phe Lys Met Lys Trp Gin Tyr Thr Asn Arg 
145 150 155 160 



4 



ill 
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Arg Ala Phe Thr Met Leu Gly Val Val Trp Leu Val Ala Val He Val 
165 170 175 

Gly Ser Pro Met Trp His Val Gin Gin Leu Glu He Lys Tyr Asp Phe 
180 185 190 

5 Leu Tyr Glu Lys Glu His He Cys Cys Leu Glu Glu Trp Thr Ser Pro 

195 200 205 

Val His Gin Lys He Tyr Thr Thr Phe He Leu Val He Leu Phe Leu 
210 215 220 

Leu Pro Leu Met Val Met Leu He Leu Tyr Ser Lys He Gly Tyr Glu 

o-jA 235 240 

10 225 230 

Leu Trp He Lys Lys Arg Val Gly Asp Gly Ser Val Leu Arg Thr He 
245 250 255 

His Gly Lys Glu Met Ser Lys He Ala Arg Lys Lys Lys Arg Ala Val 
260 265 270 



15 



He Met Met Val Thr Val Val Ala Leu Phe Ala Val Cys Trp Ala Pro 



ri 275 280 285 



Phe His Val val His Met Met He Glu Tyr Ser Asn Phe Glu Lys Glu 



III 290 295 300 



Tyr Asp Asp Val Thr He Lys Met He Phe Ala He Val Gin He He 

m 20 305 310 315 320 

Gly Phe ser Asn Ser He Cys Asn Pro He Val Tyr Ala Phe Met Asn 

325 330 335 



Glu Asn Phe Lys Lys Asn Val Leu Ser Ala Val Cys Tyr Cys He Val 
340 345 350 

25 Asn Lys Thr Phe Ser Pro Ala Gin Arg His Gly Asn Ser Gly He Thr 

355 360 365 

Met Met Arg Lys Lys Ala Lys Phe Ser Leu Arg Glu Asn Pro Val Glu 
370 375 380 

Glu Thr Lys Gly Glu Ala Phe Ser Asp Gly Asn He Glu Val Lys Leu 
30 385 390 395 

Cys Glu Gin Thr Glu Glu Lys Lys Lys Leu Lys Arg His Leu Ala Leu 
405 410 

Phe Arg Ser Glu Leu Ala Glu Asn Ser Pro Leu Asp Ser Gly His 
420 425 430 

35 (42) INFORMATION FOR SEQ ID NO: 41: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 base pairs 



m 



3^5 



(44) INFORMATION FOR SEQ ID NO:43: 

(i) SEQtJENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI- SENSE: NO 



24 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:41: 

CTGTGTACAG CAGTTCGCAG AGTG 

(43) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 base pairs 
jQ (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQtJENCE DESCRIPTION: SEQ ID NO: 42: 



Ifl 24 
15 GAGTGCCAGG CAGAGCAGGT AGAC 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:43: 
25 CCCGAATTCC TGCTTGCTCC CAGCTTGGCC C 
(45) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(iv) ANTI -SENSE: YES 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:44: 

TGTGGATCCT GCTGTCAAAG GTCCCATTCC GG 3 2 

(4 6) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
10 (iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:45: 
TCACAATGCT AGGTGTGGTC 2 0 

(47) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 22 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
20 (iv) ANTI -SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:46: 
TGCATAGACA ATGGGATTAC AG 22 

(48) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 511 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:47: 

TCACAATGCT AGGTGTGGTC TGGCTGGTGG CAGTCATCGT AGGATCACCC ATGTGGCACG 60 
TGCAACAACT TGAGATCAAA TATGACTTCC TATATGAAAA GGAACACATC TGCTGCTTAG 120 
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AAGAGTGGAC CAGCCCTGTG CACCAGAAGA TCTACACCAC CTTCATCCTT GTCATCCTCT 18 0 

TCCTCCTGCC TCTTATGGTG ATGCTTATTC TGTACGTAAA ATTGGTTATG AACTTTGGAT 24 0 

AAAGAAAAGA GTTGGGGATG GTTCAGTGCT TCGAACTATT CATGGAAAAG AAATGTCCAA 3 00 

AATAGCCAGG AAGAAGAAAC GAGCTGTCAT TATGATGGTG ACAGTGGTGG CTCTCTTTGC 3 60 

5 TGTGTGCTGG GCACCATTCC ATGTTGTCCA TATGATGATT GAATACAGTA ATTTTGAAAA 42 0 

GGAATATGAT GATGTCACAA TCAAGATGAT TTTTGCTATC GTGCAAATTA TTGGATTTTC 4 80 

CAACTCCATC TGTAATCCCA TTGTCTATGC A 511 

(4 9) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQXJENCE CHARACTERISTICS: 
10 (A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
15 (iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

CTGCTTAGAA GAGTGGACCA G 21 

(50) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQXJENCE CHARACTERISTICS: 
20 (A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
25 (iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

CTGTGCACCA GAAGATCTAC AC 

(51) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 



s=0 
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(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI -SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50; 
CAAGGATGAA GGTGGTGTAG A 
5 (52) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
10 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iv) ANTI -SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 



in 

gtgtagatct tctggtgcac agg 

" 15 (53) information for seq id no:52- 

^3 

(i) SEQUENCE CHARACTERISTICS: 

O (A) LENGTH: 21 base pairs 

'Sl (B) TYPE: nucleic acid 

Q (C) STRANDEDNESS: single 

20 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52 
GCAATGCAGG TCATAGTGAG C 
(54) INFORMATION FOR SEQ ID NO: 53: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

30 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: YES 
(iv) ANTI -SENSE: YES 



PATENT 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53 

TGGAGCATGG TGACGGGAAT GCAGAAG 

(55) INFORMATION FOR SEQ ID NO : 54 : 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
10 (iv) ANTI -SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 54 : 

GTGATGAGCA GGTCACTGAG CGCCAAG 

(56) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
20 (iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 
GCAATGCAGG CGCTTAACAT TAC 
(57) INFORMATION FOR SEQ ID NO: 56: 



PATENT 
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SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
30 (iv) ANTI -SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 
TTGGGTTACA ATCTGAAGGG CA 



(i) 



25 
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(58) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57 
10 ACTCCGTGTC CAGCAGGACT CTG 

(58) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI -SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58 
20 TGCGTGTTCC TGGACCCTCA CGTG 

(58) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE : nucleic acid 
25 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59 

30 CAGGCCTTGG ATTTTAATGT CAGGGATGG 

(61) INFORMATION FOR SEQ ID NO: 60: 

( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 2 7 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 
5 (iv) ANTI- SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60 
GGAGAGTCAG CTCTGAAAGA ATTCAGG 
(62) INFORMATION FOR SEQ ID NO: 61: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
15 (iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 
TGATGTGATG CCAGATACTA ATAGCAC 
(63) INFORMATION FOR SEQ ID NO : 62 : 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
25 (iv) ANTI -SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

CCTGATTCAT TTAGGTGAGA TTGAGAC 

(64) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(i) 



10 



(i) 
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(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 
CCCAAGCTTC CCCAGGTGTA TTTGAT 
(3) INFORMATION FOR SEQ ID NO: 63: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 64 : 
GTTGGATCCA CATAATGCAT TTTCTC 
(66) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1080 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 



26 



ATGATTCTCA 


ACTCTTCTAC 


TGAAGATGGT 


ATTAAAAGAA 


TCCAAGATGA 


TTGTCCCAAA 


60 


GCTGGAAGGC 


ATAATTACAT 


ATTTGTCATG 


ATTCCTACTT 


TATACAGTAT 


CATCTTTGTG 


120 


GTGGGAATAT 


TTGGAAACAG 


CTTGGTGGTG 


ATAGTCATTT 


ACTTTTATAT 


GAAGCTGAAG 


180 


ACTGTGGCCA 


GTGTTTTTCT 


TTTGAATTTA 


GCACTGGCTG 


ACTTATGCTT 


TTTACTGACT 


240 


TTGCCACTAT 


GGGCTGTCTA 


CACAGCTATG 


GAATACCGCT 


GGCCCTTTGG 


CAATTACCTA 


300 


TGTAAGATTG 


CTTCAGCCAG 


CGTCAGTTTC 


AACCTGTACG 


CTAGTGTGTT 


TCTACTCACG 


360 


TGTCTCAGCA 


TTGATCGATA 


CCTGGCTATT 


GTTCACCCAA 


TGAAGTCCCG 


CCTTCGACGC 


420 


ACAATGCTTG 


TAGCCAAAGT 


CACCTGCATC 


ATCATTTGGC 


TGCTGGCAGG 


CTTGGCCAGT 


480 


TTGCCAGCTA 


TAATCCATCG 


AAATGTATTT 


TTCATTGAGA 


ACACCAATAT 


TACAGTTTGT 


540 


GCTTTCCATT 


ATGAGTCCCA 


AAATTCAACC 


CTTCCGATAG 


GGCTGGGCCT 


GACCAAAAAT 


600 
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ATACTGGGTT TCCTGTTTCC TTTTCTGATC ATTCTTACAA GTTATACTCT TATTTGGAAG 660 

GCCCTAAAGA AGGCTTATGA AATTCAGAAG AACAAACCAA GAAATGATGA TATTTTTAAG 720 

ATAATTATGG CAATTGTGCT TTTCTTITTC TTT.TCCTGGA TTCCCCACCA AATATTCACT 780 

TTTCTGGATG TATTGATTCA ACTAGGCATC ATACGTGACT GTAGAATTGC AGATATTGTG 840 

5 GACACGGCCA TGCCTATCAC CATTTGTATA GCTTATTTTA ACAATTGCCT GAATCCTCTT 900 

TTTTATGGCT TTCTGGGGAA AAAATTTAAA AGATATTTTC TCCAGCTTCT AAAATATATT 960 

CCCCCAAAAG CCAAATCCCA CTCAAACCTT TCAACAAAAA TGAGCACGCT TTCCTACCGC 1020 

CCCTCAGATA ATGTAAGCTC ATCCACCAAG AAGCCTGCAC CATGTTTTGA GGTTGAGTGA 1080 



10 



(67) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 359 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: not relevant 

15 (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 66 : 

Met He Leu Asn Ser Ser Thr Glu Asp Gly He Lys Arg He Gin Asp 
1 5 10 15 

Asp Cys Pro Lys Ala Gly Arg His Asn Tyr He Phe Val Met He Pro 
20 20 25 30 

Thr Leu Tyr Ser He He Phe Val Val Gly He Phe Gly Asn Ser Leu 
35 40 45 

Val Val He Val He Tyr Phe Tyr Met Lys Leu Lys Thr Val Ala Ser 
50 55 60 

25 Val Phe Leu Leu Asn Leu Ala Leu Ala Asp Leu Cys Phe Leu Leu Thr 

^5 70 75 80 

Leu Pro Leu Trp Ala Val Tyr Thr Ala Met Glu Tyr Arg Trp Pro Phe 
85 90 95 

Gly Asn Tyr Leu Cys Lys He Ala Ser Ala Ser Val Ser Phe Asn Leu 
30 100 105 110 

Tyr Ala Ser Val Phe Leu Leu Thr Cys Leu Ser He Asp Arg Tyr Leu 
115 120 125 

Ala He Val His Pro Met Lys Ser Arg Leu Arg Arg Thr Met Leu Val 
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25 



130 



140 



Ala Lys Val Thr Cys He He He Trp Leu Leu Ala Gly Leu Ala Ser 

150 155 

Leu Pro Ala He He His Arg Asn Val Phe Phe He Glu Asn Thr Asn 
165 170 

He Thr val Cys Ala Phe His Tyr Glu Ser Gin Asn Ser Thr Leu Pro 
180 185 190 

He Gly Leu Gly Leu Thr Lys Asn He Leu Gly Phe Leu Phe Pro Phe 
195. 200 205 

Leu He He Leu Thr Ser Tyr Thr Leu He Trp Lys Ala Leu Lys Lys 
210 215 220 

13 "^"^ lie Gin Lys Asn Lys Pro Arg Asn Asp Asp He Phe Lys 

^0 225 230 235 240 

'i M 

'^4 _ Wet Ala He Val Leu Phe Phe Phe Phe Ser Trp He Pro His 

m 245 250 255 

ly 

m ^-^^ ^-^^ "^^^ Asp Val Leu He Gin Leu Gly He He Arg 

III 260 265 270 

13 Asp ^s Arg He Ala Asp He Val Asp Thr Ala Met Pro He Thr He 

ifi 275 2 80 2 85 



20 Cys He Ala Tyr Phe Asn Asn Cys Leu Asn Pro Leu Phe Tyr Gly Phe 

290 295 300 

Leu Gly Lys Lys Phe Lys Arg Tyr Phe Leu Gin Leu Leu Lys Tyr He 

310 315 320 

Pro Pro Lys Ala Lys Ser His Ser Asn Leu Ser Thr Lys Met Ser Thr 
325 330 335 

Leu Ser Tyr Arg Pro Ser Asp Asn Val Ser Ser Ser Thr Lys Lys Pro 
340 345 350 

Ala Pro Cys Phe Glu Val Glu 
355 

30 (68) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
35 ' (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67 
ACCATGGGCA GCCCCTGGAA CGGCAGC 
(69) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 
AGAACCACCA CCAGCAGGAC GCGGACGGTC TGCCGGTGG 
(70) INFORMATION FOR SEQ ID NO: 69: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 
20 GTCCGCGTCC TGCTGGTGGT GGTTCTGGCA TTTATAATT 
(71) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS: single 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 
CCTGGATCCT TATCCCATCG TCTTCACGTT AGC 



30 (72) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 



(i) 



15 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

5 CTGGAATTCT CCTGCCAGCA TGGTGA 
26 

(73) INFORMATION FOR SEQ ID NO:72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 
10 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iv) ANTI -SENSE: YES 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

• GCAGGATCCT ATATTGCGTG CTCTGTCCCC 
30 

(74) INFORMATION FOR SEQ ID NO:73: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 999 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:73: 

ATGGTGAACT CCACCCACCG TGGGATGCAC ACTTCTCTGC ACCTCTGGAA CCGCAGCAGT 60 

TACAGACTGC ACAGCAATGC CAGTGAGTCC CTTGGAAAAG GCTACTCTGA TGGAGGGTGC 12 0 

TACGAGCAAC TTTTTGTCTC TCCTGAGGTG TTTGTGACTC TGGGTGTCAT CAGCTTGTTG 180 

GAGAATATCT TAGTGATTGT GGCAATAGCC AAGAACAAGA ATCTGCATTC ACCCATGTAC 24 0 

30 TTTTTCATCT GCAGCTTGGC TGTGGCTGAT ATGCTGGTGA GCGTTTCAAA TGGATCAGAA 3 00 

ACCATTATCA TCACCCTATT AAACAGTACA GATACGGATG CACAGAGTTT CACAGTGAAT 360 

ATTGATAATG TCATTGACTC GGTGATCTGT AGCTCCTTGC TTGCATCCAT TTGCAGCCTG 42 0 
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CTTTCAATTG CAGTGGACAG GTACTTTACT ATCTTCTATG CTCTCCAGTA CCATAACATT 4 80 

ATGACAGTTA AGCGGGTTGG GATCAGCATA AGTTGTATCT GGGCAGCTTG CACGGTTTCA 54 0 

GGCATTTTGT TCATCATTTA CTCAGATAGT AGTGCTGTCA TCATCTGCCT CATCACCATG 600 

TTCTTCACCA TGCTGGCTCT CATGGCTTCT CTCTATGTCC ACATGTTCCT GATGGCCAGG 660 

5 CTTCACATTA AGAGGATTGC TGTCCTCCCC GGCACTGGTG . CCATCCGCCA AGGTGCCAAT 72 0 

ATGAAGGGAG CGATTACCTT GACCATCCTG ATTGGCGTCT TTGTTGTCTG CTGGGCCCCA 780 

TTCTTCCTCC ACTTAATATT CTACATCTCT TGTCCTCAGA ATCCATATTG TGTGTGCTTC 84 0 

ATGTCTCACT TTAACTTGTA TCTCATACTG ATCATGTGTA ATTCAATCAT CGATCCTCTG 900 

ATTTATGCAC TCCGGAGTCA AGAACTGAGG AAAACCTTCA AAGAGATCAT CTGTTGCTAT 960 

10 CCCCTGGGAG GCCTTTGTGA CTTGTCTAGC AGATATTAA 999 
(75) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 32 amino acids 

(B) TYPE: amino acid 
15 (C) STRANDEDNESS : 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 74 : 

Met Val Asn Ser Thr His Arg Gly Met His Thr Ser Leu His Leu Trp 
20 1 5 10 15 

Asn Arg Ser Ser Tyr Arg Leu His Ser Asn Ala Ser Glu Ser Leu Gly 
20 25 30 

Lys Gly Tyr Ser Asp Gly Gly Cys Tyr Glu Gin Leu Phe Val Ser Pro 
35 40 45 

25 Glu Val Phe Val Thr Leu Gly Val lie Ser Leu Leu Glu Asn lie Leu 

50 55 60 

Val lie Val Ala lie Ala Lys Asn Lys Asn Leu His Ser Pro Met Tyr 
65 70 75 80 

Phe Phe lie Cys Ser Leu Ala Val Ala Asp Met Leu Val Ser Val Ser 
30 85 90 95 

Asn Gly Ser Glu Thr lie lie lie Thr Leu Leu Asn Ser Thr Asp Thr 
100 105 110 

Asp Ala Gin Ser Phe Thr Val Asn lie Asp Asn Val lie Asp Ser Val 
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125 



He Cys Ser Ser Leu Leu Ala Ser He Cys Ser Leu Leu Ser He Ala 
^30 135 140 

Val Asp Arg Tyr Phe Thr He Phe Tyr Ala Leu Gin Tyr His Asn He 

150 155 160 

Met Thr Val Lys Arg Val Gly He Ser He Ser Cys He Trp Ala Ala 
165 170 175 

Cys Thr Val Ser Gly He Leu Phe He He Tyr Ser Asp Ser Ser Ala 
180 185 190 

Val He He Cys Leu He Thr Met Phe Phe Thr Met Leu Ala Leu Met 
195 200 205 

Ala Ser Leu Tyr Val His Met Phe Leu Met Ala Arg Leu His He Lys 
210 215 220 

Arg He Ala Val Leu Pro Gly Thr Gly Ala He Arg Gin Gly Ala Asn 
225 230 235 240 

Met Lys Gly Ala He Thr Leu Thr He Leu He Gly Val Phe Val Val 
245 250 255 

Cys Trp Ala Pro Phe Phe Leu His Leu He Phe Tyr He Ser Cys Pro 
260 265 270 

Gin Asn Pro Tyr Cys Val Cys Phe Met Ser His Phe Asn Leu Tyr Leu 
275 280 285 

He Leu He Met Cys Asn Ser He He Asp Pro Leu He Tyr Ala Leu 
290 295 300 

Arg Ser Gin Glu Leu Arg Lys Thr Phe Lys Glu He He Cys Cys Tyr 
305 310 315 320 

Pro Leu Gly Gly Leu Cys Asp Leu Ser Ser Arg Tyr 



325 



330 



(76) INFORMATION FOR SEQ ID NO: 75: 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:75: 



CCGAAGCTTC GAGCTGAGTA AGGCGGCGGG CT 
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(77) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 
5 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) » SEQUENCE DESCRIPTION: SEQ ID N0:76: 
GTGGAATTCA TTTGCCCTGC CTCAACCCCC A 
10 (78) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1344 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 

ATGGAGCTGC TAAAGCTGAA CCGGAGCGTG CAGGGAACCG GACCCGGGCC GGGGGCTTCC 60 

CTGTGCCGCC CGGGGGCGCC TCTCCTCAAC AGCAGCAGTG TGGGCAACCT CAGCTGCGAG 12 0 

20 CCCCCTCGCA TTCGCGGAGC CGGGACACGA GAATTGGAGC TGGCCATTAG AATCACTCTT 180 

TACGCAGTGA TCTTCCTGAT GAGCGTTGGA GGAAATATGC TCATCATCGT GGTCCTGGGA 24 0 

CTGAGCCGCC GCCTGAGGAC TGTCACCAAT GCCTTCCTCC TCTCACTGGC AGTCAGCGAC 3 00 

CTCCTGCTGG CTGTGGCTTG CATGCCCTTC ACCCTCCTGC CCAATCTCAT GGGCACATTC 3 60 

ATCTTTGGCA CCGTCATCTG CAAGGCGGTT TCCTACCTCA TGGGGGTGTC TGTGAGTGTG 420 

25 TCCACGCTAA GCCTCGTGGC CATCGCACTG GAGCGATATA GCGCCATCTG CCGACCACTG 4 80 

CAGGCACGAG TGTGGCAGAC GCGCTCCCAC GCGGCTCGCG TGATTGTAGC CACGTGGCTG 54 0 

CTGTCCGGAC TACTCATGGT GCCCTACCCC GTGTACACTG TCGTGCAACC AGTGGGGCCT 600 

CGTGTGCTGC AGTGCGTGCA TCGCTGGCCC AGTGCGCGGG TCCGCCAGAC CTGGTCCGTA 660 

CTGCTGCTTC TGCTCTTGTT CTTCATCCCA GGTGTGGTTA TGGCCGTGGC CTACGGGCTT 720 

30 ATCTCTCGCG AGCTCTACTT AGGGCTTCGC TTTGACGGCG ACAGTGACAG CGACAGCCAA 780 

AGCAGGGTCC GAAACCAAGG CGGGCTGCCA GGGGCTGTTC ACCAGAACGG GCGTTGCCGG 84 0 



960 
1020 



1344 
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CCTGAGACTG GCGCGGTTGG CAAAGACAGC GATGGCTGCT ACGTGCAACT TCCACGTTCC 900 
CGGCCTGCCC TGGAGCTGAC GGCGCTGACG GCTCCTGGGC CGGGATCCGG CTCCCGGCCC 
ACCCAGGCCA AGCTGCTGGC TAAGAAGCGC GTGGTGCGAA TGTTGCTGGT GATCGTTGTG 

CTTTTTTTTC TGTGTTGGTT GCCAGTTTAT AGTGCCAACA CGTGGCGCGC CTTTGATGGC 108 0 

5 CCGGGTGCAC ACCGAGCACT CTCGGGTGCT CCTATCTCCT TCATTCACTT GCTGAGCTAC 114 0 

GCCTCGGCCT GTGTCAACCC CCTGGTCTAC TGCTTCATGC ACCGTCGCTT TCGCCAGGCC 12 0 0 

TGCCTGGAAA CTTGCGCTCG CTGCTGCCCC CGGCCTCCAC GAGCTCGCCC CAGGGCTCTT 1260 

CCCGATGAGG ACCCTCCCAC TCCCTCCATT GCTTCGCTGT CCAGGCTTAG CTACACCACC 132 0 
ATCAGCACAC TGGGCCCTGG CTGA 
10 (79) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 447 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 
15 (D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 

Met Glu Leu Leu Lys Leu Asn Arg Ser Val Gin Gly Thr Gly Pro Gly 
^5 10 15 

20 Pro Gly Ala Ser Leu Cys Arg Pro Gly Ala Pro Leu Leu Asn Ser Ser 

20 25 30 

Ser Val Gly Asn Leu Ser Cys Glu Pro Pro Arg He Arg Gly Ala Gly 
35 40 45 

Thr Arg Glu Leu Glu Leu Ala He Arg He Thr Leu Tyr Ala Val He 
25 50 55 60 

Phe Leu Met Ser Val Gly Gly Asn Met Leu He He Val Val Leu Gly 
65 70 75 80 

Leu Ser Arg Arg Leu Arg Thr Val Thr Asn Ala Phe Leu Leu Ser Leu 
85 90 95 

30 Ala Val Ser Asp Leu Leu Leu Ala Val Ala Cys Met Pro Phe Thr Leu 

100 105 110 

Leu Pro Asn Leu Met Gly Thr Phe He Phe Gly Thr Val He Cys Lys 
115 120 125 
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Ala Val Ser Tyr Leu Met Gly Val Ser Val Ser Val Ser Thr Leu Ser 
130 135 140 

Leu Val Ala lie Ala Leu Glu Arg Tyr Ser Ala He Cys Arg Pro Leu 
145 150 155 160 

5 Gin Ala Arg Val Trp Gin Thr Arg Ser His Ala Ala Arg Val He Val 

165 170 175 

Ala Thr Trp Leu Leu Ser Gly Leu Leu Met Val Pro Tyr Pro Val Tyr 
180 185 190 

Thr Val Val Gin Pro Val Gly Pro Arg Val Leu Gin Cys Val His Arg 
10 195 200 205 

Trp Pro Ser Ala Arg Val Arg Gin Thr Trp Ser Val Leu Leu Leu Leu 
210 215 220 

kQ Leu Leu Phe Phe He Pro Gly Val Val Met Ala Val Ala Tyr Gly Leu 

225 230 235 240 



15 



He Ser Arg Glu Leu Tyr Leu Gly Leu Arg Phe Asp Gly Asp Ser Aap 



l\l 245 250 255 



Ser Asp Ser Gin Ser Arg Val Arg Asn Gin Gly Gly Leu Pro Gly Ala 
260 265 270 

Val His Gin Asn Gly Arg Cys Arg Pro Glu Thr Gly Ala Val Gly Lys 
20 275 280 285 

Asp Ser Asp Gly Cys Tyr Val Gin Leu Pro Arg Ser Arg Pro Ala Leu 
290 295 300 

Glu Leu Thr Ala Leu Thr Ala Pro Gly Pro Gly Ser Gly Ser Arg Pro 
305 310 315 320 

25 Thr Gin Ala Lys Leu Leu Ala Lys Lys Arg Val Val Arg Met Leu Leu 

325 330 335 

Val He Val Val Leu Phe Phe Leu Cys Trp Leu Pro Val Tyr Ser Ala 
340 345 350 

Asn Thr Trp Arg Ala Phe Asp Gly Pro Gly Ala His Arg Ala Leu Ser 
30 355 360 365 

Val Ala Pro He Ser Phe He His Leu Leu Ser Tyr Ala Ser Ala Cys 
370 375 380 

Val Asn Pro Leu Val Tyr Cys Phe Met His Arg Arg Phe Arg Gin Ala 
385 390 395 400 

35 Cys Leu Glu Thr Cys Ala Arg Cys Cys Pro Arg Pro Pro Arg Ala Arg 

405 410 415 



Pro Arg Ala Leu Pro Asp Glu Asp Pro Pro Thr Pro Ser He Ala Ser 
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420 425 430 

Leu Ser Arg Leu Ser Tyr Thr Thr lie Ser Thr Leu Gly Pro Gly 
435 440 445 



(80) INFORMATION FOR SEQ ID NO: 79: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

10 (ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 
TGCAAGCTTA AAAAGGAAAA AATGAACAGC 30 
(81) INFORMATION FOR SEQ ID NO: 80: 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 

TAAGGATCCC TTCCCTTCAA AACATCCTTG 3 0 

(82) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1014 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 
30 ATGAACAGCA CATGTATTGA AGAACAGCAT GACCTGGATC ACTATTTGTT TCCCATTGTT 60 



TACATCTTTG TGATTATAGT CAGCATTCCA GCCAATATTG GATCTCTGTG TGTGTCTTTC 120 



CTGCAACCCA AGAAGGAAAG TGAACTAGGA ATTTACCTCT TCAGTTTGTC ACTATCAGAT 180 



TTACTCTATG CATTAACTCT CCCTTTATGG ATTGATTATA CTTGGAATAA AGACAACTGG 240 



^0 
IB 
Si 

m 
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ACTTTCTCTC CTGCCTTGTG CAAAGGGAGT GCTTTTCTCA TGTACATGAA GTTTTACAGC 3 00 

AGCACAGCAT TCCTCACCTG CATTGCCGTT GATCGGTATT TGGCTGTTGT CTACCCTTTG 3 60 

AAGTTTTTTT TCCTAAGGAC AAGAAGAATT GCACTCATGG TCAGCCTGTC CATCTGGATA 420 

TTGGAAACCA TCTTCAATGC TGTCATGTTG TGGGAAGATG AAACAGTTGT TGAATATTGC 4 80 

5 GATGCCGAAA AGTCTAATTT TACTTTATGC TATGACAAAT ACCCTTTAGA GAAATGGCAA 54 0 

ATCAACCTCA ACTTGTTCAG GACGTGTACA GGCTATGCAA TACCTTTGGT CACCATCCTG 600 

ATCTGTAACC GGAAAGTCTA CCAAGCTGTG CGGCACAATA AAGCCACGGA AAACAAGGAA 660 

AAGAAGAGAA TCATAAAACT ACTTGTCAGC ATCACAGTTA CTTTTGTCTT ATGCTTTACT 720 

CCCTTTCATG TGATGTTGCT GATTCGCTGC ATTTTAGAGC ATGCTGTGAA CTTCGAAGAC 7 80 

10 CACAGCAATT CTGGGAAGCG AACTTACACA ATGTATAGAA TCACGGTTGC ATTAACAAGT 840 

TTAAATTGTG TTGCTGATCC AATTCTGTAC TGTTTTGTTA CCGAAACAGG AAGATATGAT 900 

ATGTGGAATA TATTAAAATT CTGCACTGGG AGGTGTAATA CATCACAAAG ACAAAGAAAA 960 



CGCATACTTT CTGTGTCTAC AAAAGATACT ATGGAATTAG AGGTCCTTGA GTAG 1014 

" (83) INFORMATION FOR SEQ ID NO: 82: 

15 (i) SEQUENCE CHARACTERISTICS: 

Q (A) LENGTH: 3 37 amino acids 

S| (B) TYPE: amino acid 

^3 (C) STRANDEDNESS : 

(D) TOPOLOGY: not relevant 

20 (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 

Met Asn Ser Thr Cys lie Glu Glu Gin His Asp Leu Asp His Tyr Leu 
15 10 15 

Phe Pro lie Val Tyr lie Phe Val lie lie Val Ser lie Pro Ala Asn 
25 20 25 30 

lie Gly Ser Leu Cys Val Ser Phe Leu Gin Pro Lys Lys Glu Ser Glu 
35 40 45 

Leu Gly lie Tyr Leu Phe Ser Leu Ser Leu Ser Asp Leu Leu Tyr Ala 
50 55 60 

30 Leu Thr Leu Pro Leu Trp He Asp Tyr Thr Trp Asn Lys Asp Asn Trp 

65 70 75 80 

Thr Phe Ser Pro Ala Leu Cys Lys Gly Ser Ala Phe Leu Met Tyr Met 
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85 90 95 

Lys Phe Tyr Ser Ser Thr Ala Phe Leu Thr Cys He Ala Val Asp Arg 
100 105 110 

Tyr Leu Ala Val Val Tyr Pro Leu Lys Phe Phe Phe Leu Arg Thr Arg 
5 115 120 125 

Arg He Ala Leu Met Val Ser Leu Ser He Trp He Leu Glu Thr He 
130 135 140 

Phe Asn Ala Val Met Leu Trp Glu Asp Glu Thr Val Val Glu Tyr Cys 
145 150 155 160 

10 Asp Ala Glu Lys Ser Asn Phe Thr Leu Cys Tyr Asp Lys Tyr Pro Leu 

165 170 175 

Glu Lys Trp Gin He Asn Leu Asn Leu Phe Arg Thr Cys Thr Gly Tyr 
180 185 190 

Ala He Pro Leu Val Thr He Leu He Cys Asn Arg Lys Val Tyr Gin 
15 195 200 205 

Ala Val Arg His Asn Lys Ala Thr Glu Asn Lys Glu Lys Lys Arg He 
210 215 220 

He Lys Leu Leu Val Ser He Thr Val Thr Phe Val Leu Cys Phe Thr 
225 230 235 240 

20 Pro Phe His Val Met Leu Leu He Arg Cys He Leu Glu His Ala Val 

245 250 255 

Asn Phe Glu Asp His Ser Asn Ser Gly Lys Arg Thr Tyr Thr Met Tyr 
260 265 270 

Arg He Thr Val Ala Leu Thr Ser Leu Asn Cys Val Ala Asp Pro He 
25 275 280 285 

Leu Tyr Cys Phe Val Thr Glu Thr Gly Arg Tyr Asp Met Trp Asn He 
290 295 300 

Leu Lys Phe Cys Thr Gly Arg Cys Asn Thr Ser Gin Arg Gin Arg Lys 
305 310 315 320 

30 Arg He Leu Ser Val Ser Thr Lys Asp Thr Met Glu Leu Glu Val Leu 

325 330 335 

Glu 



(84) INFORMATION FOR SEQ ID NO: 83: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 0 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 



5 CAGGAAGAAG AAACGAGCTG TCATTATGAT GGTGACAGTG 
40 



(85) INFORMATION FOR SEQ ID NO : 84 : 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 40 base pairs 
10 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 



15 CACTGTCACC ATCATAATGA CAGCTCGTTT CTTCTTCCTG 
40 



(86) INFORMATION FOR SEQ ID NO: 85: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 30 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85 



25 GGCCACCGGC AGACCAAACG CGTCCTGCTG 
30 



(87) INFORMATION FOR SEQ ID NO: 86: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 31 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86 
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CTCCTTCGGT CCTCCTATCG TTGTCAGAAG T 
31 

(88) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



m 



m 



10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 

GGAAAAGAAG AGAATCAAAA AACTACTTGT CAGCATC 3 7 

(89) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 31 base pairs 
15 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 
20 CTCCTTCGGT CCTCCTATCG TTGTCAGAAG T 31 

(90) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1080 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 

ATGATTCTCA ACTCTTCTAC TGAAGATGGT ATTAAAAGAA TCCAAGATGA TTGTCCCAAA 60 

30 GCTGGAAGGC ATAATTACAT ATTTGTCATG ATTCCTACTT TATACAGTAT CATCTTTGTG 12 0 

GTGGGAATAT TTGGAAACAG CTTGGTGGTG ATAGTCATTT ACTTTTATAT GAAGCTGAAG 18 0 

ACTGTGGCCA GTGTTTTTCT TTTGAATTTA GCACTGGCTG ACTTATGCTT TTTACTGACT 24 0 



TTGCCACTAT GGGCTGTCTA CACAGCTATG GAATACCGCT GGCCCTTTGG CAATTACCTA 3 00 
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TGTAAGATTG CTTCAGCCAG CGTCAGTTTC AACCTGTACG CTAGTGTGTT TCTACTCACG 3 60 

TGTCTCAGCA TTGATCGATA CCTGGCTATT GTTCACCCAA TGAAGTCCCG CCTTCGACGC 42 0 

ACAATGCTTG TAGCCAAAGT CACCTGCATC ATCATTTGGC TGCTGGCAGG CTTGGCCAGT 4 80 

TTGCCAGCTA TAATCCATCG AAATGTATTT TTCATTGAGA ACACCAATAT TACAGTTTGT 54 0 

5 GCTTTCCATT ATGAGTCCCA AAATTCAACC CTTCCGATAG GGCTGGGCCT GACCAAAAAT 600 

ATACTGGGTT TCCTGTTTCC TTTTCTGATC ATTCTTACAA GTTATACTCT TATTTGGAAG 660 

GCCCTAAAGA AGGCTTATGA AATTCAGAAG AACAAACCAA GAAATGATGA TATTAAAAAG 72 0 

ATAATTATGG CAATTGTGCT TTTCTTTTTC TTTTCCTGGA TTCCCCACCA AATATTCACT 780 

TTTCTGGATG TATTGATTCA ACTAGGCATC ATACGTGACT GTAGAATTGC AGATATTGTG 840 

10 GACACGGCCA TGCCTATCAC CATTTGTATA GCTTATTTTA ACAATTGCCT GAATCCTCTT 900 

TTTTATGGCT TTCTGGGGAA AAAATTTAAA AGATATTTTC TCCAGCTTCT AAAATATATT 960 

CCCCCAAAAG CCAAATCCCA CTCAAACCTT TCAACAAAAA TGAGCACGCT TTCCTACCGC 1020 

CCCTCAGATA ATGTAAGCTC ATCCACCAAG AAGCCTGCAC CATGTTTTGA GGTTGAGTGA 1080 
(91) INFORMATION FOR SEQ ID NO: 90: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 59 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: not relevant 

20 (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 

Met lie Leu Asn Ser Ser Thr Glu Asp Gly lie Lys Arg lie Gin Asp 
15 10 15 

Asp Cys Pro Lys Ala Gly Arg His Asn Tyr lie Phe Val Met lie Pro 
25 20 25 30 

Thr Leu Tyr Ser lie lie Phe Val Val Gly lie Phe Gly Asn Ser Leu 
35 40 45 

Val Val lie Val lie Tyr Phe Tyr Met Lys Leu Lys Thr Val Ala Ser 
50 55 60 

30 Val Phe Leu Leu Asn Leu Ala Leu Ala Asp Leu Cys Phe Leu Leu Thr 

65 70 75 80 

Leu Pro Leu Trp Ala Val Tyr Thr Ala Met Glu Tyr Arg Tirp Pro Phe 
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85 90 95 

Gly Asn Tyr Leu Cys Lys lie Ala Ser Ala Ser Val Ser Phe Asn Leu 
100 105 110 

Tyr Ala Ser Val Phe Leu Leu Thr Cys Leu Ser lie Asp Arg Tyr Leu 
5 115 120 125 

Ala lie Val His Pro Met Lys Ser Arg Leu Arg Arg Thr Met Leu Val 
130 135 140 

Ala Lys Val Thr Cys lie lie lie Trp Leu Leu Ala Gly Leu Ala Ser 
145 150 155 160 

10 Leu Pro Ala lie lie His Arg Asn Val Phe Phe lie Glu Asn Thr Asn 

165 170 175 

lie Thr Val Cys Ala Phe His Tyr Glu Ser Gin Asn Ser Thr Leu Pro 
180 185 190 

lie Gly Leu Gly Leu Thr Lys Asn lie Leu Gly Phe Leu Phe Pro Phe 
15 195 200 205 

Leu lie lie Leu Thr Ser Tyr Thr Leu lie Trp Lys Ala Leu Lys Lys 
210 215 220 

Ala Tyr Glu lie Gin Lys Asn Lys Pro Arg Asn Asp Asp lie Lys Lys 
225 230 235 240 

20 lie lie Met Ala lie Val Leu Phe Phe Phe Phe Ser Trp lie Pro His 

245 250 255 

Gin lie Phe Thr Phe Leu Asp Val Leu lie Gin Leu Gly lie lie Arg 
260 265 270 

Asp Cys Arg He Ala Asp He Val Asp Thr Ala Met Pro He Thr lie 
25 275 280 285 

Cys He Ala Tyr Phe Asn Asn Cys Leu Asn Pro Leu Phe Tyr Gly Phe 
290 295 300 

Leu Gly Lys Lys Phe Lys Arg Tyr Phe Leu Gin Leu Leu Lys Tyr He 
305 310 315 320 

30 Pro Pro Lys Ala Lys Ser His Ser Asn Leu Ser Thr Lys Met Ser Thr 

325 330 335 

Leu Ser Tyr Arg Pro Ser Asp Asn Val Ser Ser Ser Thr Lys Lys Pro 
340 345 350 

Ala Pro Cys Phe Glu Val Glu 
35 355 

(92) INFORMATION FOR SEQ ID NO: 91: 



ID 



ni 

m 
m 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
5 ' ( D ) TO POLOG Y : 1 i ne a r 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 
CCAAGAAATG ATGATATTAA AAAGATAATT ATGGC 35 

(93) INFORMATION FOR SEQ ID NO : 92 : 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 92 : 
CTCCTTCGGT CCTCCTATCG TTGTCAGAAG T 31 

(94) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 1080 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 

ATGATTCTCA ACTCTTCTAC TGAAGATGGT ATTAAAAGAA TCCAAGATGA TTGTCCCAAA 60 

GCTGGAAGGC ATAATTACAT ATTTGTCATG ATTCCTACTT TATACAGTAT CATCTTTGTG 120 

GTGGGAATAT TTGGAAACAG CTTGGTGGTG ATAGTCATTT ACTTTTATAT GAAGCTGAAG 180 

ACTGTGGCCA GTGTTTTTCT TTTGAATTTA GCACTGGCTG ACTTATGCTT TTTACTGACT 24 0 

30 TTGCCACTAT GGGCTGTCTA CACAGCTATG GAATACCGCT GGCCCTTTGG CAATTACCTA 3 00 

TGTAAGATTG CTTCAGCCAG CGTCAGTTTC GCCCTGTACG CTAGTGTGTT TCTACTCACG 3 60 

TGTCTCAGCA TTGATCGATA CCTGGCTATT GTTCACCCAA TGAAGTCCCG CCTTCGACGC 42 0 
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ACAATGCTTG TAGCCAAAGT CACCTGCATC ATCATTTGGC TGCTGGCAGG CTTGGCCAGT 4 80 

TTGCCAGCTA TAATCCATCG AAATGTATTT TTCATTGAGA ACACCAATAT TACAGTTTGT 54 0 

GCTTTCCATT ATGAGTCCCA AAATTCAACC CTTCCGATAG GGCTGGGCCT GACCAAAAAT 600 

ATACTGGGTT TCCTGTTTCC TTTTCTGATC ATTCTTACAA GTTATACTCT TATTTGGAAG 660 

5 GCCCTAAAGA AGGCTTATGA AATTCAGAAG AACAAACCAA GAAATGATGA TATTTTTAAG 72 0 

ATAATTATGG CAATTGTGCT TTTCTTTTTC TTTTCCTGGA TTCCCCACCA AATATTCACT 7 80 

TTTCTGGATG TATTGATTCA ACTAGGCATC ATACGTGACT GTAGAATTGC AGATATTGTG 84 0 

GACACGGCCA TGCCTATCAC CATTTGTATA GCTTATTTTA ACAATTGCCT GAATCCTCTT 900 

TTTTATGGCT TTCTGGGGAA AAAATTTAAA AGATATTTTC TCCAGCTTCT AAAATATATT 960 

10 CCCCCAAAAG CCAAATCCCA CTCAAACCTT TCAACAAAAA TGAGCACGCT TTCCTACCGC 1020 

CCCTCAGATA ATGTAAGCTC ATCCACCAAG AAGCCTGCAC CATGTTTTGA GGTTGAGTGA 1080 

(95) INFORMATION FOR SEQ ID NO : 94 : 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 359 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: not relevant 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 94 



20 



Met lie Leu Asn Ser Ser Thr Glu Asp Gly lie Lys Arg lie Gin Asp 
1 5 10 15 



Asp Cys Pro Lys Ala Gly Arg His Asn Tyr lie Phe Val Met lie Pro 
20 25 30 



25 



Thr Leu Tyr Ser lie lie Phe Val Val Gly lie Phe Gly Asn Ser Leu 
35 40 45 



Val Val lie Val lie Tyr Phe Tyr Met Lys Leu Lys Thr Val Ala Ser 
50 55 60 



Val Phe Leu Leu Asn Leu Ala Leu Ala Asp Leu Cys Phe Leu Leu Thr 
65 70 75 80 



30 



Leu Pro Leu Trp Ala Val Tyr Thr Ala Met Glu Tyr Arg Trp Pro Phe 
85 90 95 



Gly Asn Tyr Leu Cys Lys lie Ala Ser Ala Ser Val Ser Phe Ala Leu 
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100 105 110 

Tyr Ala Ser Val Phe Leu Leu Thr Cys Leu Ser lie Asp Arg Tyr Leu 
115 120 125 

Ala He Val His Pro Met Lys Ser Arg Leu Arg Arg Thr Met Leu Val 
5 130 135 140 

Ala Lys Val Thr Cys He lie He Trp Leu Leu Ala Gly Leu Ala Ser 
145 150 155 160 

Leu Pro Ala He He His Arg Asn Val Phe Phe He Glu Asn Thr Asn 
165 170 175 

10 He Thr Val Cys Ala Phe His Tyr Glu Ser Gin Asn Ser Thr Leu Pro 

180 185 190 

He Gly Leu Gly Leu Thr Lys Asn He Leu Gly Phe Leu Phe Pro Phe 
195 200 205 

Leu He He Leu Thr Ser Tyr Thr Leu He Trp Lys Ala Leu Lys Lys 
15 210 215 220 

Ala Tyr Glu He Gin Lys Asn Lys Pro Arg Asn Asp Asp He Phe Lys 
225 230 235 240 

He He Met Ala He Val Leu Phe Phe Phe Phe Ser Trp He Pro His 
245 250 255 

20 Gin He Phe Thr Phe Leu Asp Val Leu He Gin Leu Gly He He Arg 

260 265 270 

Asp Cys Arg He Ala Asp He Val Asp Thr Ala Met Pro He Thr He 
275 280 285 

Cys He Ala Tyr Phe Asn Asn Cys Leu Asn Pro Leu Phe Tyr Gly Phe 
25 290 295 300 

Leu Gly Lys Lys Phe Lys Arg Tyr Phe Leu Gin Leu Leu Lys Tyr He 
305 310 315 320 

Pro Pro Lys Ala Lys Ser His Ser Asn Leu Ser Thr Lys Met Ser Thr 
325 330 335 

30 Leu Ser Tyr Arg Pro Ser Asp Asn Val Ser Ser Ser Thr Lys Lys Pro 

340 345 350 

Ala Pro Cys Phe Glu Val Glu 
355 

(97) INFORMATION FOR SEQ ID NO: 95: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

( D ) TOPOLOGY ; 1 inear 



(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI- SENSE: NO 



5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 

CCCAAGCTTC CCCAGGTGTA TTTGAT 

(97) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 9 base pairs 
10 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI -SENSE: YES 



15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 

CCTGCAGGCG AAACTGACTC TGGCTGAAG 

(98) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 42 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI-SENSE: NO 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 

CTGTACGCTA GTGTGTTTCT ACTCACGTGT CTCAGCATTG AT 

(99) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 6 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 
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(iv) ANTI- SENSE: YES 

(xi) SEQtJENCE DESCRIPTION: SEQ ID NO: 98: 

GTTGGATCCA CATAATGCAT TTTCTC 26 
(100) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1080 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 

ATGATTCTCA ACTCTTCTAC TGAAGATGGT ATTAAAAGAA TCCAAGATGA TTGTCCCAAA 60 

GCTGGAAGGC ATAATTACAT ATTTGTCATG ATTCCTACTT TATACAGTAT CATCTTTGTG 12 0 

GTGGGAATAT TTGGAAACAG CTTGGTGGTG ATAGTCATTT ACTTTTATAT GAAGCTGAAG 180 

ACTGTGGCCA GTGTTTTTCT TTTGAATTTA GCACTGGCTG ACTTATGCTT TTTACTGACT 24 0 

TTGCCACTAT GGGCTGTCTA CACAGCTATG GAATACCGCT GGCCCTTTGG CAATTACCTA 3 00 

TGTAAGATTG CTTCAGCCAG CGTCAGTTTC AACCTGTACG CTAGTGTGTT TCTACTCACG 3 60 

TGTCTCAGCA TTGATCGATA CCTGGCTATT GTTCACCCAA TGAAGTCCCG CCTTCGACGC 42 0 

ACAATGCTTG TAGCCAAAGT CACCTGCATC ATCATTTGGC TGCTGGCAGG CTTGGCCAGT' 4 80 

TTGCCAGCTA TAATCCATCG AAATGTATTT TTCATTGAGA ACACCAATAT TACAGTTTGT 54 0 

GCTTTCCATT ATGAGTCCCA AAATTCAACC CTTCCGATAG GGCTGGGCCT GACCAAAAAT 600 

ATACTGGGTT TCCTGTTTCC TTTTCTGATC ATTCTTACAA GTTATTTTGG AATTCGAAAA 660 

CACTTACTGA AGACGAATAG CTATGGGAAG AACAGGATAA CCCGTGACCA AGTTAAGAAG 72 0 

ATAATTATGG CAATTGTGCT TTTCTTTTTC TTTTCCTGGA TTCCCCACCA AATATTCACT 780 

TTTCTGGATG TATTGATTCA ACTAGGCATC ATACGTGACT GTAGAATTGC AGATATTGTG 840 

GACACGGCCA TGCCTATCAC CATTTGTATA GCTTATTTTA ACAATTGCCT GAATCCTCTT 900 

TTTTATGGCT TTCTGGGGAA AAAATTTAAA AGATATTTTC TCCAGCTTCT AAAATATATT 960 

CCCCCAAAAG CCAAATCCCA CTCAAACCTT TCAACAAAAA TGAGCACGCT TTCCTACCGC 1020 

CCCTCAGATA ATGTAAGCTC ATCCACCAAG AAGCCTGCAC CATGTTTTGA GGTTGAGTGA 1080 



AREN-0054 



-140- 



PATENT 



(101) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 359 amino acids 

(B) TYPE: amino acid 
5 (C) STRANDEDNESS : 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 

Met lie Leu Asn Ser Ser Thr Glu Asp Gly lie Lys Arg lie Gin Asp 
10 1 5 10 15 

Asp Cys Pro Lys Ala Gly Arg His Asn Tyr lie Phe Val Met lie Pro 
20 25 30 

Thr Leu Tyr Ser lie lie Phe Val Val Gly lie Phe Gly Asn Ser Leu 
35 40 45 

15 Val Val lie Val lie Tyr Phe Tyr Met Lys Leu Lys Thr Val Ala Ser 

50 55 60 

Val Phe Leu Leu Asn Leu Ala Leu Ala Asp Leu Cys Phe Leu Leu Thr 
65 70 75 80 

Leu Pro Leu Trp Ala Val Tyr Thr Ala Met Glu Tyr Arg Trp Pro Phe 
20 85 90 95 

Gly Asn Tyr Leu Cys Lys lie Ala Ser Ala Ser Val Ser Phe Asn Leu 
100 105 110 

Tyr Ala Ser Val Phe Leu Leu Thr Cys Leu Ser lie Asp Arg Tyr Leu 
115 120 125 

25 Ala lie Val His Pro Met Lys Ser Arg Leu Arg Arg Thr Met Leu Val 

130 135 140 

Ala Lys Val Thr Cys lie lie lie Trp Leu Leu Ala Gly Leu Ala Ser 
145 150 155 160 

Leu Pro Ala lie lie His Arg Asn Val Phe Phe lie Glu Asn Thr Asn 
30 165 170 175 

lie Thr Val Cys Ala Phe His Tyr Glu Ser Gin Asn Ser Thr Leu Pro 
180 185 190 

lie Gly Leu Gly Leu Thr Lys Asn lie Leu Gly Phe Leu Phe Pro Phe 
195 200 205 



35 



Leu lie lie Leu Thr Ser Tyr Phe Gly lie Arg Lys His Leu Leu Lys 
210 215 220 
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Thr Asn Ser Tyr Gly Lys Asn Arg lie Thr Arg Asp Gin Val Lys Lys 
225 230 235 240 

lie lie Met Ala lie Val Leu Phe Phe Phe Phe Ser Trp lie Pro His 
245 250 255 

5 Gin lie Phe Thr Phe Leu Asp Val Leu lie Gin Leu Gly lie lie Arg 

260 265 270 

Asp Cys Arg lie Ala Asp lie Val Asp Thr Ala Met Pro lie Thr lie 
275 280 285 

Cys lie Ala Tyr Phe Asn Asn Cys Leu Asn Pro Leu Phe Tyr Gly Phe 
10 290 295 300 

Leu Gly Lys Lys Phe Lys Arg Tyr Phe Leu Gin Leu Leu Lys Tyr He 
305 310 315 320 

Pro Pro Lys Ala Lys Ser His Ser Asn Leu Ser Thr Lys Met Ser Thr 
325 330 335 

15 Leu Ser Tyr Arg Pro Ser Asp Asn Val Ser Ser Ser Thr Lys Lys Pro 

340 345 350 

Ala Pro Cys Phe Glu Val Glu 
355 

(102) INFORMATION FOR SEQ ID NO: 101: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: DNA (genomic) 

(iv) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 
TCCGAATTCC AAAATAACTT GTAAGAATGA TCAGAAA 3 7 

(103) INFORMATION FOR SEQ ID NO: 102: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

35 (ii) MOLECULE TYPE: DNA (genomic) 



(iv) ANTI- SENSE: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 
AGATCTTAAG AAGATAATTA TGGCAATTGT GCT 33 

(104) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 62 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
10 (iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 
AATTCGAAAA CACTTACTGA AGACGAATAG CTATGGGAAG AACAGGATAA CCCGTGACCA 60 
AG 62 

(105) INFORMATION FOR SEQ ID NO: 104: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE: DNA (genomic) 

(iv) ANTI -SENSE: YES 

(xi) SEQtJENCE DESCRIPTION: SEQ ID NO: 104: 
TTAACTTGGT CACGGGTTAT CCTGTTCTTC CCATAGCTAT TCGTCTTCAG TAAGTGTTTT 60 
CG 62 
25 (106) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1083 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 
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ATGATTCTCA ACTCTTCTAC TGAAGATGGT ATTAAAAGAA TCCAAGATGA TTGTCCCAAA 60 

GCTGGAAGGC ATAATTACAT ATTTGTCATG ATTCCTACTT TATACAGTAT CATCTTTGTG 12 0 

GTGGGAATAT TTGGAAACAG CTTGGTGGTG ATAGTCATTT ACTTTTATAT GAAGCTGAAG 180 

ACTGTGGCCA GTGTTTTTCT TTTGAATTTA GCACTGGCTG ACTTATGCTT TTTACTGACT 240 

5 TTGCCACTAT GGGCTGTCTA CACAGCTATG GAATACCGCT GGCCCTTTGG CAATTACCTA 3 00 

TGTAAGATTG CTTCAGCCAG CGTCAGTTTC AACCTGTACG CTAGTGTGTT TCTACTCACG 3 60 

TGTCTCAGCA TTGATCGATA CCTGGCTATT GTTCACCCAA TGAAGTCCCG CCTTCGACGC 42 0 

ACAATGCTTG TAGCCAAAGT CACCTGCATC ATCATTTGGC TGCTGGCAGG CTTGGCCAGT 4 80 

TTGCCAGCTA TAATCCATCG AAATGTATTT TTCATTGAGA ACACCAATAT TACAGTTTGT 54 0 

!□ 10 GCTTTCCATT ATGAGTCCCA AAATTCAACC CTTCCGATAG GGCTGGGCCT GACCAAAAAT 600 

m ATACTGGGTT TCCTGTTTCC TTTTCTGATC ATTCTTACAA GTTATACTCT TATTTGGAAG 660 

Ifl GCCCTAAAGA AGGCTTATGA AATTCAGAAG AACAAACCAA GAAATGATGA TATTTTTAAG 72 0 

s y 

If] ATAATTATGG CAGCAATTGT GCTTTTCTTT TTCTTTTCCT GGATTCCCCA CCAAATATTC 780 

l"' ACTTTTCTGG ATGTATTGAT TCAACTAGGC ATCATACGTG ACTGTAGAAT TGCAGATATT 84 0 

iS 15 GTGGACACGG CCATGCCTAT CACCATTTGT ATAGCTTATT TTAACAATTG CCTGAATCCT 900 

CTTTTTTATG GCTTTCTGGG GAAAAAATTT AAAAGATATT TTCTCCAGCT TCTAAAATAT 960 

ATTCCCCCAA AAGCCAAATC CCACTCAAAC CTTTCAACAA AAATGAGCAC GCTTTCCTAC 102 0 

CGCCCCTCAG ATAATGTAAG CTCATCCACC AAGAAGCCTG CACCATGTTT TGAGGTTGAG 10 80 

TGA 1083 
20 (107) INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 60 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 
25 (D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: 

Met lie Leu Asn Ser Ser Thr Glu Asp Gly lie Lys Arg lie Gin Asp 
15 10 15 

30 Asp Cys Pro Lys Ala Gly Arg His Asn Tyr lie Phe Val Met lie Pro 
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20 25 30 

Thr Leu Tyr Ser lie lie Phe Val Val Gly lie Phe Gly Asn Ser Leu 
35 40 45 

Val Val lie Val lie Tyr Phe Tyr Met Lys Leu Lys Thr Val Ala Ser 
5 50 55 60 

Val Phe Leu Leu Asn Leu Ala Leu Ala Asp Leu Cys Phe Leu Leu Thr 
65 70 75 80 

Leu Pro Leu Trp Ala Val Tyr Thr Ala Met Glu Tyr Arg Trp Pro Phe 
85 90 95 

10 Gly Asn Tyr Leu Cys Lys lie Ala Ser Ala Ser Val Ser Phe Asn Leu 

100 105 110 

Tyr Ala Ser Val Phe Leu Leu Thr Cys Leu Ser lie Asp Arg Tyr Leu 
115 120 125 

Ala lie Val His Pro Met Lys Ser Arg Leu Arg Arg Thr Met Leu Val 
15 130 135 140 



p; Ala Lys Val Thr Cys lie lie lie Trp Leu Leu Ala Gly Leu Ala Ser 

145 150 155 160 



Leu Pro Ala lie lie His Arg Asn Val Phe Phe lie Glu Asn Thr Asn 
165 170 175 

20 He Thr Val Cys Ala Phe His Tyr Glu Ser Gin Asn Ser Thr Leu Pro 

180 185 190 

He Gly Leu Gly Leu Thr Lys Asn He Leu Gly Phe Leu Phe Pro Phe 
195 200 205 

Leu lie He Leu Thr Ser Tyr Thr Leu He Trp Lys Ala Leu Lys Lys 
25 210 215 220 

Ala Tyr Glu He Gin Lys Asn Lys Pro Arg Asn Asp Asp He Phe Lys 
225 230 235 240 

He He Met Ala Ala He Val Leu Phe Phe Phe Phe Ser Trp He Pro 
245 250 255 

30 His Gin He Phe Thr Phe Leu Asp Val Leu He Gin Leu Gly He He 

260 265 270 

Arg Asp Cys Arg He Ala Asp He Val Asp Thr Ala Met Pro He Thr 
275 280 285 

He Cys He Ala Tyr Phe Asn Asn Cys Leu Asn Pro Leu Phe Tyr Gly 
35 290 295 300 



Phe Leu Gly Lys Lys Phe Lys Arg Tyr Phe Leu Gin Leu Leu Lys Tyr 
305 310 315 320 
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He Pro Pro Lys Ala Lys Ser His Ser Asn Leu Ser Thr Lys Met Ser 
325 330 335 

Thr Leu Ser Tyr Arg Pro Ser Asp Asn Val Ser Ser Ser Thr Lys Lys 
340 345 350 

5 Pro Ala Pro Cys Phe Glu Val Glu 

355 360 

(108) INFORMATION FOR SEQ ID NO: 107: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 26 base pairs 
10 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

]0 (iv) ANTI-SENSE: NO 

W 
'SI 

^f^ 15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 

iU 

Ifl CCCAAGCTTC CCCAGGTGTA TTTGAT 2 6 

SI : 

(109) INFORMATION FOR SEQ ID NO: 108: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 3 8 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI -SENSE: YES 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 

AAGCACAATT GCTGCATAAT TATCTTAAAA ATATCATC 3 8 

(110) INFORMATION FOR SEQ ID NO: 109: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 3 9 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(iv) ANTI -SENSE: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 
AAGATAATTA TGGCAGCAAT TGTGCTTTTC TTTTTCTTT 3 9 

(111) INFORMATION FOR SEQ ID NO: 110: 

(i) SEQXJENCE CHARACTERISTICS: 
5 (A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
10 (iv> ANTI-SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 
GTTGGATCCA CATAATGCAT TTTCTC 26 

(112) INFORMATION FOR SEQ ID NO till: 

m 

py (i) SEQUENCE CHARACTERISTICS: 

I ft 15 (A) LENGTH: 1344 base pairs 

fy (B) TYPE: nucleic acid 

r (C) STRANDEDNESS: single 

13 (D) TOPOLOGY: linear 

Ik (ii) MOLECULE TYPE: DNA (genomic) 



1^ 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 

ATGGAGCTGC TAAAGCTGAA CCGGAGCGTG CAGGGAACCG GACCCGGGCC GGGGGCTTCC 60 

CTGTGCCGCC CGGGGGCGCC TCTCCTCAAC AGCAGCAGTG TGGGCAACCT CAGCTGCGAG 120 

CCCCCTCGCA TTCGCGGAGC CGGGACACGA GAATTGGAGC TGGCCATTAG AATCACTCTT 180 

TACGCAGTGA TCTTCCTGAT GAGCGTTGGA GGAAATATGC TCATCATCGT GGTCCTGGGA 24 0 

25 CTGAGCCGCC GCCTGAGGAC TGTCACCAAT GCCTTCCTCC TCTCACTGGC AGTCAGCGAC 3 00 

CTCCTGCTGG CTGTGGCTTG CATGCCCTTC ACCCTCCTGC CCAATCTCAT GGGCACATTC 360 

ATCTTTGGCA CCGTCATCTG CAAGGCGGTT TCCTACCTCA TGGGGGTGTC TGTGAGTGTG 420 

TCCACGCTAA GCCTCGTGGC CATCGCACTG GAGCGATATA GCGCCATCTG CCGACCACTG 4 80 

CAGGCACGAG TGTGGCAGAC GCGCTCCCAC GCGGCTCGCG TGATTGTAGC CACGTGGCTG 540 

30 CTGTCCGGAC TACTCATGGT GCCCTACCCC GTGTACACTG TCGTGCAACC AGTGGGGCCT 600 

CGTGTGCTGC AGTGCGTGCA TCGCTGGCCC AGTGCGCGGG TCCGCCAGAC CTGGTCCGTA 660 
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CTGCTGCTTC TGCTCTTGTT CTTCATCCCA GGTGTGGTTA TGGCCGTGGC CTACGGGCTT 7 20 

ATCTCTCGCG AGCTCTACTT AGGGCTTCGC TTTGACGGCG ACAGTGACAG CGACAGCCAA 7 80 

AGCAGGGTCC GAAACCAAGG CGGGCTGCCA GGGGCTGTTC ACCAGAACGG GCGTTGCCGG 84 0 

CCTGAGACTG GCGCGGTTGG CAAAGACAGC GATGGCTGCT ACGTGCAACT TCCACGTTCC 900 

5 CGGCCTGCCC TGGAGCTGAC GGCGCTGACG GCTCCTGGGC CGGGATCCGG CTCCCGGCCC 960 

ACCCAGGCCA AGCTGCTGGC TAAGAAGCGC GTGAAACGAA TGTTGCTGGT GATCGTTGTG 1020 

CTTTTTTTTC TGTGTTGGTT GCCAGTTTAT AGTGCCAACA CGTGGCGCGC CTTTGATGGC 10 80 

CCGGGTGCAC ACCGAGCACT CTCGGGTGCT CCTATCTCCT TCATTCACTT GCTGAGCTAC 114 0 

GCCTCGGCCT GTGTCAACCC CCTGGTCTAC TGCTTCATGC ACCGTCGCTT TCGCCAGGCC 12 00 

10 TGCCTGGAAA CTTGCGCTCG CTGCTGCCCC CGGCCTCCAC GAGCTCGCCC CAGGGCTCTT 12 60 

CCCGATGAGG ACCCTCCCAC TCCCTCCATT GCTTCGCTGT CCAGGCTTAG CTACACCACC 1320 

ATCAGCACAC TGGGCCCTGG CTGA 1344 

(113) INFORMATION FOR SEQ ID NO:112: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 447 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 

Met Glu Leu Leu Lys Leu Asn Arg Ser Val Gin Gly Thr Gly Pro Gly 
1 5 10 15 

Pro Gly Ala Ser Leu Cys Arg Pro Gly Ala Pro Leu Leu Asn Ser Ser 
20 25 30 

25 Ser Val Gly Asn Leu Ser Cys Glu Pro Pro Arg lie Arg Gly Ala Gly 

35 40 45 

Thr Arg Glu Leu Glu Leu Ala lie Arg lie Thr Leu Tyr Ala Val lie 
50 55 60 

Phe Leu Met Ser Val Gly Gly Asn Met Leu lie lie Val Val Leu Gly 
30 65 70 75 80 

Leu Ser Arg Arg Leu Arg Thr Val Thr Asn Ala Phe Leu Leu Ser Leu 
85 90 95 
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Ala Val Ser Asp Leu Leu Leu Ala Val Ala Cys Met Pro Phe Thr Leu 
^00 105 110 

Leu Pro Asn Leu Met Gly Thr Phe He Phe Gly Thr Val He Cys Lys 
115 120 125 

5 Ala Val Ser Tyr Leu Met Gly Val Ser Val Ser Val Ser Thr Leu Ser 

130 135 140 

Leu Val Ala He Ala Leu Glu Arg Tyr Ser Ala He Cys Arg Pro Leu 

150 155 160 

Gin Ala Arg Val Trp Gin Thr Arg Ser His Ala Ala Arg Val He Val 
1^ 165 170 175 

Ala Thr Trp Leu Leu Ser Gly Leu Leu Met Val Pro Tyr Pro Val Tyr 
180 185 190 

Thr Val Val Gin Pro Val Gly Pro Arg Val Leu Gin Cys Val His Arg 
195 200 205 

15 Trp Pro Ser Ala Arg Val Arg Gin Thr Trp Ser Val Leu Leu Leu Leu^ 

210 215 220 

III ^€su Leu Phe Phe He Pro Gly Val Val Met Ala Val Ala Tyr Gly Leu 

225 230 235 240 

Q He Ser Arg Glu Leu Tyr Leu Gly Leu Arg Phe Asp Gly Asp Ser Asp 

20 245 250 255 

H in? 

'H Ser Asp Ser Gin Ser Arg Val Arg Asn Gin Gly Gly Leu Pro Gly Ala 

260 265 270 



III 



25 



Val His Gin Asn Gly Arg Cys Arg Pro Glu Thr Gly Ala Val Gly Lys 
275 280 285 

Asp Ser Asp Gly Cys Tyr Val Gin Leu Pro Arg Ser Arg Pro Ala Leu 
290 295 300 

Glu Leu Thr Ala Leu Thr Ala Pro Gly Pro Gly Ser Gly Ser Arg Pro 
305 310 315 320 

Thr Gin Ala Lys Leu Leu Ala Lys Lys Arg Val Lys Arg Met Leu Leu 
30 325 330 335 

Val He Val Val Leu Phe Phe Leu Cys Trp Leu Pro Val Tyr Ser Ala 
340 345 350 

Asn Thr Trp Arg Ala Phe Asp Gly Pro Gly Ala His Arg Ala Leu Ser 
355 360 365 



35 



Val Ala Pro He Ser Phe He His Leu Leu Ser Tyr Ala Ser Ala Cys 
370 375 380 



Val Asn Pro Leu Val Tyr Cys Phe Met His Arg Arg Phe Arg Gin Ala 
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385 390 395 400 

Cys Leu Glu Thr Cys Ala Arg Cys Cys Pro Arg Pro Pro Arg Ala Arg 
405 410 415 

Pro Arg Ala Leu Pro Asp Glu Asp Pro Pro Thr Pro Ser He Ala Ser 
^ 420 425 430 

Leu Ser Arg Leu Ser Tyr Thr Thr He Ser Thr Leu Gly Pro Gly 
435 440 445 

(114) INFORMATION FOR SEQ ID NO: 113: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:113: 

CAGCAGCATG CGCTTCACGC GCTTCTTAGC CCAG 34 
' (115) INFORMATION FOR SEQ ID NO:114: 

o 

(i) SEQUENCE CHARACTERISTICS: 
P (A) LENGTH: 33 base pairs 

Si 20 (B) TYPE: nucleic acid 

13 (C) STRANDEDNESS: single 

1^ (D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114: 

25 AGAAGCGCGT GAAGCGCATG CTGCTGGTGA TCGTT 35 

(116) INFORMATION FOR SEQ ID NO:115: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI -SENSE: NO 



^0 

IE 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:115: 
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ATGGAGAAAA GAATCAAAAG AATGTTCTAT ATA 
(117) INFORMATION FOR SEQ ID NO: 116: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI- SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116 
TATATAGAAC ATTCTTTTGA TTCTTTTCTC CAT 
(118) INFORMATION FOR SEQ ID NO: 117: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI -SENSE: NO 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117: 

CGCTCTCTGG CCTTGAAGCG CACGCTCAGC 
(119) INFORMATION FOR SEQ ID N0:118: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
{ iv) ANTI -SENSE : YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118: 
GCTGAGCGTG CGCTTCAAGG CCAGAGAGCG 
(120) INFORMATION FOR SEQ ID NO: 119: 



(i) 



15 



(i) 



25 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119 
CCCAGGAAAA AGGTGAAAGT CAAAGTTTTC 
10 (121) INFORMATION FOR SEQ ID NO: 120: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iv) ANTI -SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 0 
GAAAACTTTG ACTTTCACCT TTTTCCTGGG 
20 (122) INFORMATION FOR SEQ ID NO: 121: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingle 
25 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 

(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121 
GGGGCGCGGG TGAAACGGCT GGTGAGC 
30 (123) INFORMATION FOR SEQ ID NO: 122: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122 
5 GCTCACCAGC CGTTTCACCC GCGCCCC 

(124) INFORMATION FOR SEQ ID NO:123: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 
10 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 3 
15 CCCCTTGAAA AGCCTAAGAA CTTGGTCATC 

(125) INFORMATION FOR SEQ ID NO:124: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI -SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124 
25 GATGACCAAG TTCTTAGGCT TTTCAAGGGG 

(126) INFORMATION FOR SEQ ID NO: 125: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 
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(iv) ANTI-SENSE: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:125: 
GATCTCTAGA ATGAACAGCA CATGTATTGA AG 32 

(127) INFORMATION FOR SEQ ID NO: 126: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

10 (ii) MOLECULE TYPE: DNA (genomic) 

(iv) ANTI -SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 126: 
CTAGGGTACC CGCTCAAGGA CCTCTAATTC CATAG 35 

(12 8) INFORMATION FOR SEQ ID NO: 127: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1296 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:127: 

ATGCAGGCGC TTAACATTAC CCCGGAGCAG TTCTCTCGGC TGCTGCGGGA CCACAACCTG 60 

ACGCGGGAGC AGTTCATCGC TCTGTACCGG CTGCGACCGC TCGTCTACAC CCCAGAGCTG 12 0 

CCGGGACGCG CCAAGCTGGC CCTCGTGCTC ACCGGCGTGC TCATCTTCGC CCTGGCGCTC 180 

25 TTTGGCAATG CTCTGGTGTT CTACGTGGTG ACCCGCAGCA AGGCCATGCG CACCGTCACC 24 0 

AACATCTTTA TCTGCTCCTT GGCGCTCAGT GACCTGCTCA TCACCTTCTT CTGCATTCCC 3 00 

GTCACCATGC TCCAGAACAT TTCCGACAAC TGGCTGGGGG GTGCTTTCAT TTGCAAGATG 360 

GTGCCATTTG TCCAGTCTAC CGCTGTTGTG ACAGAAATGC TCACTATGAC CTGCATTGCT 420 

GTGGAAAGGC ACCAGGGACT TGTGCATCCT TTTAAAATGA AGTGGCAATA CACCAACCGA 48 0 
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AGGGCTTTCA CAATGCTAGG TGTGGTCTGG CTGGTGGCAG TCATCGTAGG ATCACCCATG 54 0 

TGGCACGTGC AACAACTTGA GATCAAATAT GACTTCCTAT ATGAAAAGGA ACACATCTGC 600 

TGCTTAGAAG AGTGGACCAG CCCTGTGCAC CAGAAGATCT ACACCACCTT CATCCTTGTC 660 

ATCCTCTTCC TCCTGCCTCT TATGGTGATG CTTATTCTGT ACAGTAAAAT TGGTTATGAA 720 

5 CTTTGGATAA AGAAAAGAGT TGGGGATGGT TCAGTGCTTC GAACTATTCA TGGAAAAGAA 7 80 

ATGTCCAAAA TAGCCAGGAA GAAGAAACGA GCTAAGATTA TGATGGTGAC AGTGGTGGCT 84 0 

CTCTTTGCTG TGTGCTGGGC ACCATTCCAT GTTGTCCATA TGATGATTGA ATACAGTAAT 900 

TTTGAAAAGG AATATGATGA TGTCACAATC AAGATGATTT TTGCTATCGT GCAAATTATT 960 

GGATTTTCCA ACTCCATCTG TAATCCCATT GTCTATGCAT TTATGAATGA AAACTTCAAA 102 0 

10 AAAAATGTTT TGTCTGCAGT TTGTTATTGC ATAGTAAATA AAACCTTCTC TCCAGCACAA 1080 

AGGCATGGAA ATTCAGGAAT TACAATGATG CGGAAGAAAG CAAAGTTTTC CCTCAGAGAG 114 0 

AATCCAGTGG AGGAAACCAA AGGAGAAGCA TTCAGTGATG GCAACATTGA AGTCAAATTG 1200 

TGTGAACAGA CAGAGGAGAA GAAAAAGCTC AAACGACATC TTGCTCTCTT TAGGTCTGAA 1260 

CTGGCTGAGA ATTCTCCTTT AGACAGTGGG CATTAA 1296 

15 (129) INFORMATION FOR SEQ ID NO: 128: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 431 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

20 (D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 8: 

Met Gin Ala Leu Asn He Thr Pro Glu Gin Phe Ser Arg Leu Leu Arg 
15 10 15 

25 Asp His Asn Leu Thr Arg Glu Gin Phe He Ala Leu Tyr Arg Leu Arg 

20 25 30 

Pro Leu Val Tyr Thr Pro Glu Leu Pro Gly Arg Ala Lys Leu Ala Leu 
35 40 45 

Val Leu Thr Gly Val Leu He Phe Ala Leu Ala Leu Phe Gly Asn Ala 
30 50 55 60 

Leu Val Phe Tyr Val Val Thr Arg Ser Lys Ala Met Arg Thr Val Thr 
65 70 75 80 



IB 



m 
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Asn lie Phe He Cys Ser Leu Ala Leu Ser Asp Leu Leu He Thr Phe 
85 90 95 

Phe Cys He Pro Val Thr Met Leu Gin Asn He Ser Asp Asn Trp Leu 
100 105 110 

5 Gly Gly Ala Phe He Cys Lys Met Val Pro Phe Val Gin Ser Thr Ala 

115 120 125 

Val Val Thr Glu Met Leu Thr Met Thr Cys He Ala Val Glu Arg His 
130 135 140 

Gin Gly Leu Val His Pro Phe Lys Met Lys Trp Gin Tyr Thr Asn Arg 
10 145 150 155 160 

Arg Ala Phe Thr Met Leu Gly Val Val Trp Leu Val Ala Val He Val 
165 170 175 

Gly Ser Pro Met Trp His Val Gin Gin Leu Glu He Lys Tyr Asp Phe 
180 185 190 

15 Leu Tyr Glu Lys Glu His He Cys Cys Leu Glu Glu Trp Thr Ser Pro 

195 200 205 

Val His Gin Lys He Tyr Thr Thr Phe He Leu Val He Leu Phe Leu 
210 215 220 

Leu Pro Leu Met Val Met Leu He Leu Tyr Ser Lys He Gly Tyr Glu 
20 225 230 235 240 

Leu Trp He Lys Lys Arg Val Gly Asp Gly Ser Val Leu Arg Thr He 
245 250 255 

His Gly Lys Glu Met Ser Lys He Ala Arg Lys Lys Lys Arg Ala Lys 
260 265 270 

25 He Met Met Val Thr Val Val Ala Leu Phe Ala Val Cys Trp Ala Pro 

275 280 285 

Phe His Val Val His Met Met He Glu Tyr Ser Asn Phe Glu Lys Glu 
290 295 300 

Tyr Asp Asp Val Thr He Lys Met He Phe Ala He Val Gin He He 
30 305 310 315 320 

Gly Phe Ser Asn Ser He Cys Asn Pro He Val Tyr Ala Phe Met Asn 
325 330 335 

Glu Asn Phe Lys Lys Asn Val Leu Ser Ala Val Cys Tyr Cys He Val 
340 345 350 

35 Asn Lys Thr Phe Ser Pro Ala Gin Arg His Gly Asn Ser Gly He Thr 

355 360 365 



Met Met Arg Lys Lys Ala Lys Phe Ser Leu Arg Glu Asn Pro Val Glu 
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370 375 380 

Glu Thr Lys Gly Glu Ala Phe Ser Asp Gly Asn lie Glu Val Lys Leu 
385 390 395 400 

Cys Glu Gin Thr Glu Glu Lys Lys Lys Leu Lys Arg His Leu Ala Leu 
5 405 410 415 

Phe Arg Ser Glu Leu Ala Glu Asn Ser Pro Leu Asp Ser Gly His 
420 425 430 

(130) INFORMATION FOR SEQ ID NO: 129: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 2 040 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

'J (ii) MOLECULE TYPE: DNA (genomic) 

IS 

2 15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 9: 

ATGGGCAGCC CCTGGAACGG CAGCGACGGC CCCGAGGGGG CGCGGGAGCC GCCGTGGCCC 

ill 60 

m 

s_ GCGCTGCCGC CTTGCGACGA GCGCCGCTGC TCGCCCTTTC CCCTGGGGGC GCTGGTGCCG 

P 120 

m 

13 20 GTGACCGCTG TGTGCCTGTG CCTGTTCGTC GTCGGGGTGA GCGGCAACGT GGTGACCGTG 

'4 180 

ATGCTGATCG GGCGCTACCG GGACATGCGG ACCACCACCA ACTTGTACCT GGGCAGCATG 
240 

GCCGTGTCCG ACCTACTCAT CCTGCTCGGG CTGCCGTTCG ACCTGTACCG CCTCTGGCGC 
25 300 

TCGCGGCCCT GGGTGTTCGG GCCGCTGCTC TGCCGCCTGT CCCTCTACGT GGGCGAGGGC 
360 

TGCACCTACG CCACGCTGCT GCACATGACC GCGCTCAGCG TCGAGCGCTA CCTGGCCATC 
30 420 

TGCCGCCCGC TCCGCGCCCG CGTCTTGGTC ACCCGGCGCC GCGTCCGCGC GCTCATCGCT 
480 

GTGCTCTGGG CCGTGGCGCT GCTCTCTGCC GGTCCCTTCT TGTTCCTGGT GGGCGTCGAG 
35 540 

CAGGACCCCG GCATCTCCGT AGTCCCGGGC CTCAATGGCA CCGCGCGGAT CGCCTCCTCG 
600 



40 CCTCTCGCCT CGTCGCCGCC TCTCTGGCTC TCGCGGGCGC CACCGCCGTC CCCGCCGTCG 
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660 

GGGCCCGAGA CCGCGGAGGC 
720 

5 

CAGCTGGGCG CGCTGCGTGT 
780 

CTGTGCCTCA GCATCCTCTA 
10 840 

CTGCGAGGCC CGGCCGCCTC 
900 

15 CGTAAGTGGA GCCGCCGTGG 
960 

GCGCAAACGC TGGGTCCCCT 
1020 

20 

TTTCCTATTT CGATTCCAGC 
1080 

TCCTGTCCCC CAGGAGCTCT 
25 1140 

CGATTCAGTA ACCAGCAGTG 
1200 

30 TTCTTAATCC AACCACCTGT 
1260 

AGACGAGGGA GATTTCATTA 
1320 

35 

TAAAGTAAAC CTTGCTCGTA 
1380 

TTCAACAGAG AACAGAAAAC 
40 144 0 

CGGCTTGTTC AGAGAAATTG 
1500 

45 AGCCTACTAT GCAGTTTTAA 
1560 

GGGGTGAGGA TCTGCCTAGG 
1620 

50 

GCAGATGGTT CCTTGTCGGG 
1680 

GGTGCTGTGT CTTATGTTGC 
55 1740 



-157- 

CGCGGCGCTG TTCAGCCGCG 
CATGCTGTGG GTCACCACCG 
CGGGCTCATC GGGCGGGAGC 
GGGGCGGGAG AGAGGCCACC 
TTCCAAAGAC GCCTGCCTGC 
TCCCCTGCTC GCCCAGCTCT 
CTCCACCCGC CGGTACTTCC 
GGGGGACCCC AGGGCGCTTT 
CTTTTCCAGA GCCTCTGAGA 
TAGATGCCAC AAATGAGGAG 
AGCTAAAATT TTTTATTTAA 
TCAAAAAGTA AAGATTGTGC 
TTGTCTCCGA AGTGGGTTTG 
CTCCTTCTGG TTTATGTCCA 
AGCAAGTATC CATGCAGCCT 
TAGAAGTTTT CTCTAATTTA 
GTGGGGGGTT TATTTGCTTC 
AGTGGTGGTG GTTCTGGCAT 



PATENT 

AATGCCGGCC GAGCCCCGCG 
CCTACTTCTT CCTGCCCTTT 
TGTGGAGCAG CCGGCGGCCG 
GGCAGACCAA ACGCGTCCTG 
AGTCCGCCCC GCCGGGGACC 
GGGCGCCGCT TCCAGCTCCC 
CATCCCCCGA GAAAACCATG 
GAGGGTGGGA TCCCCGGATC 
CCAGAAAGGA GAGTTGGTAA 
TCCTCACAGT GCTCTTGAGA 
TGTTAAGTGA TGCTGAAGGC 
AGACCTGTTG TAGAATTCTT 
TGGAAGGAAG CCTGCCAAGG 
GCCTTGATAA CACATATGGG 
GCAGCCTGGT CATTTTTTCT 
TTTTGCTGTT ACTTGTTATT 
CCAATGCTTT TGTTAATCCC 
TTATAATTTG CTGGTTGCCC 
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TTCCACGTTG GCAGAATCAT TTACATAAAC ACGGAAGATT CGCGGATGAT GTACTTCTCT 
1800 

5 CAGTACTTTA ACATCGTCGC TCTGCAACTT TTCTATCTGA GCGCATCTAT CAACCCT^TC 
1860 

CTCTACAACC TCATTTCAAA GAAGTACAGA GCGGCGGCCT TTAAACTGCT GCTCGCAAGG 
1920 

10 

AAGTCCAGGC CGAGAGGCTT CCACAGAAGC AGGGACACTG CGGGGGAAGT TGCAGGGGAC 
1980 

ACTGGAGGAG ACACGGTGGG CTACACCGAG ACAAGCGCTA ACGTGAAGAC GATGGGATAA 
15 2040 

(131) INFORMATION FOR SEQ ID NO: 13 0: 

(i) SEQUENCE CHARACTERISTICS: 
^3 (A) LENGTH: 412 amino acids 

(B) TYPE: amino acid 
20 (C) STRANDEDNESS: 

(D) TOPOLOGY: not relevant 

iU (ii) MOLECULE TYPE: protein 

in 
iU 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 130: 

ip Met Gly Ser Pro Trp Asn Gly Ser Asp Gly Pro Glu Gly Ala Arg Glu 

25 1 5 10 15 



Pro Pro Trp Pro Ala Leu Pro Pro Cys Asp Glu Arg Arg Cys Ser Pro 
20 25 30 

Phe Pro Leu Gly Ala Leu Val Pro Val Thr Ala Val Cys Leu Cys Leu 
35 40 45 

30 Phe Val Val Gly Val Ser Gly Asn Val Val Thr Val Met Leu He Gly 

50 55 60 

Arg Tyr Arg Asp Met Arg Thr Thr Thr Asn Leu Tyr Leu Gly Ser Met 
65 70 75 80 

Ala Val Ser Asp Leu Leu He Leu Leu Gly Leu Pro Phe Asp Leu Tyr 
35 85 90 95 

Arg Leu Trp Arg Ser Arg Pro Trp Val Phe Gly Pro Leu Leu Cys Arg 
100 105 110 

Leu Ser Leu Tyr Val Gly Glu Gly Cys Thr Tyr Ala Thr Leu Leu His 
115 120 125 



40 



Met Thr Ala Leu Ser Val Glu Arg Tyr Leu Ala He Cys Arg Pro Leu 
130 135 140 



IP 

If J 
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Arg Ala Arg Val Leu Val Thr Arg Arg Arg Val Arg Ala Leu lie Ala 
145 150 155 160 

Val Leu Trp Ala Val Ala Leu Leu Ser Ala Gly Pro Phe Leu Phe Leu 
165 170 175 

5 Val Gly Val Glu Gin Asp Pro Gly He Ser Val Val Pro Gly Leu Asn 

180 185 190 

Gly Thr Ala Arg He Ala Ser Ser Pro Leu Ala Ser Ser Pro Pro Leu 
195 200 205 

Trp Leu Ser Arg Ala Pro Pro Pro Ser Pro Pro Ser Gly Pro Glu Thr 
10 210 215 220 

Ala Glu Ala Ala Ala Leu Phe Ser Arg Glu Cys Arg Pro Ser Pro Ala 
225 230 235 240 

Gin Leu Gly Ala Leu Arg Val Met Leu Trp Val Thr Thr Ala Tyr Phe 
245 250 255 

15 Phe Leu Pro Phe Leu Cys Leu Ser He Leu Tyr Gly Leu He Gly Arg 

260 265 270 

Glu Leu Trp Ser Ser Arg Arg Pro Leu Arg Gly Pro Ala Ala Ser Gly 
275 280 285 

Arg Glu Arg Gly His Arg Gin Thr Lys Arg Val Leu Leu Val Val Val 
P 20 290 295 300 

13 Leu Ala Phe He He Cys Trp Leu Pro Phe His Val Gly Arg He He 

H 305 310 315 320 

Tyr He Asn Thr Glu Asp Ser Arg Met Met Tyr Phe Ser Gin Tyr Phe 
325 330 335 

25 Asn He Val Ala Leu Gin Leu Phe Tyr Leu Ser Ala Ser He Asn Pro 

340 345 350 

He Leu Tyr Asn Leu He Ser Lys Lys Tyr Arg Ala Ala Ala Phe Lys 
355 360 365 

Leu Leu Leu Ala Arg Lys Ser Arg Pro Arg Gly Phe His Arg Ser Arg 
30 370 375 380 

Asp Thr Ala Gly Glu Val Ala Gly Asp Thr Gly Gly Asp Thr Val Gly 
385 390 395 400 

Tyr Thr Glu Thr Ser Ala Asn Val Lys Thr Met Gly 
405 410 

35 (132) INFORMATION FOR SEQ ID NO: 131: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1344 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

5 (xi) SEQXJENCE DESCRIPTION: SEQ ID NO: 131: 

ATGGAGCTGC TAAAGCTGAA CCGGAGCGTG CAGGGAACCG GACCCGGGCC GGGGGCTTCC 
60 

CTGTGCCGCC CGGGGGCGCC TCTCCTCAAC AGCAGCAGTG TGGGCAACCT CAGCTGCGAG 
120 

10 CCCCCTCGCA TTCGCGGAGC CGGGACACGA GAATTGGAGC TGGCCATTAG AATCACTCTT 
180 

TACGCAGTGA TCTTCCTGAT GAGCGTTGGA GGAAATATGC TCATCATCGT GGTCCTGGGA 
240 

CTGAGCCGCC GCCTGAGGAC TGTCACCAAT GCCTTCCTCC TCTCACTGGC AGTCAGCGAC 
15 300 

CTCCTGCTGG CTGTGGCTTG CATGCCCTTC ACCCTCCTGC CCAATCTCAT GGGCACATTC 
360 

ATCTTTGGCA CCGTCATCTG CAAGGCGGTT TCCTACCTCA TGGGGGTGTC TGTGAGTGTG 
420 

20 TCCACGCTAA GCCTCGTGGC CATCGCACTG GAGCGATATA GCGCCATCTG CCGACCACTG 
480 

CAGGCACGAG TGTGGCAGAC GCGCTCCCAC GCGGCTCGCG TGATTGTAGC CACGTGGCTG 
540 

CTGTCCGGAC TACTCATGGT GCCCTACCCC GTGTACACTG TCGTGCAACC AGTGGGGCCT 
25 600 

CGTGTGCTGC AGTGCGTGCA TCGCTGGCCC AGTGCGCGGG TCCGCCAGAC CTGGTCCGTA 
660 

CTGCTGCTTC TGCTCTTGTT CTTCATCCCA GGTGTGGTTA TGGCCGTGGC CTACGGGCTT 
720 

30 ATCTCTCGCG AGCTCTACTT AGGGCTTCGC TTTGACGGCG ACAGTGACAG CGACAGCCAA 
780 

AGCAGGGTCC GAAACCAAGG CGGGCTGCCA GGGGCTGTTC ACCAGAACGG GCGTTGCCGG 
840 

CCTGAGACTG GCGCGGTTGG CAAAGACAGC GATGGCTGCT ACGTGCAACT TCCACGTTCC 
35 900 

CGGCCTGCCC TGGAGCTGAC GGCGCTGACG GCTCCTGGGC CGGGATCCGG CTCCCGGCCC 
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960 

ACCCAGGCCA AGCTGCTGGC TAAGAAGCGC GTGAAACGAA TGTTGCTGGT GATCGTTGTG 
1020 

CTTTTTTTTC TGTGTTGGTT GCCAGTTTAT AGTGCCAACA CGTGGCGCGC CTTTGATGGC 
5 1080 

CCGGGTGCAC ACCGAGCACT CTCGGGTGCT CCTATCTCCT TCATTCACTT GCTGAGCTAC 
1140 

GCCTCGGCCT GTGTCAACCC CCTGGTCTAC TGCTTCATGC ACCGTCGCTT TCGCCAGGCC 
1200 

10 TGCCTGGAAA CTTGCGCTCG CTGCTGCCCC CGGCCTCCAC GAGCTCGCCC CAGGGCTCTT 
1260 

CCCGATGAGG ACCCTCCCAC TCCCTCCATT GCTTCGCTGT CCAGGCTTAG CTACACCACC 
O 1320 

IQ ATCAGCACAC TGGGCCCTGG CTGA 

M 15 1344 



ly 



5f 5 



(133) INFORMATION FOR SEQ ID NO:132: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 447 amino acids 

(B) TYPE: amino acid 
20 (C) STRANDEDNESS : 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:132: 

Met Glu Leu Leu Lys Leu Asn Arg Ser Val Gin Gly Thr Gly Pro Gly 
25 1 5 10 15 

Pro Gly Ala Ser Leu Cys Arg Pro Gly Ala Pro Leu Leu Asn Ser Ser 
20 25 30 

Ser Val Gly Asn Leu Ser Cys Glu Pro Pro Arg lie Arg Gly Ala Gly 
35 40 45 

30 Thr Arg Glu Leu Glu Leu Ala lie Arg lie Thr Leu Tyr Ala Val lie 

50 55 60 

Phe Leu Met Ser Val Gly Gly Asn Met Leu lie lie Val Val Leu Gly 
65 70 75 80 

Leu Ser Arg Arg Leu Arg Thr Val Thr Asn Ala Phe Leu Leu Ser Leu 
35 85 90 95 



Ala Val Ser Asp Leu Leu Leu Ala Val Ala Cys Met Pro Phe Thr Leu 



m 
m 

if? 

iu 
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100 105 110 

Leu Pro Asn Leu Met Gly Thr Phe lie Phe Gly Thr Val lie Cys Lys 
115 120 125 

Ala Val Ser Tyr Leu Met Gly Val Ser Val Ser Val Ser Thr Leu Ser 
5 130 135 140 

Leu Val Ala lie Ala Leu Glu Arg Tyr Ser Ala lie Cys Arg Pro Leu 
145 150 155 160 

Gin Ala Arg Val Trp Gin Thr Arg Ser His Ala Ala Arg Val lie Val 
165 170 175 

10 Ala Thr Trp Leu Leu Ser Gly Leu Leu Met Val Pro Tyr Pro Val Tyr 

180 185 190 

Thr Val Val Gin Pro Val Gly Pro Arg Val Leu Gin Cys Val His Arg 
195 200 205 



ig Trp Pro Ser Ala Arg Val Arg Gin Thr Trp Ser Val Leu Leu Leu Leu 



15 210 215 220 

Leu Leu Phe Phe lie Pro Gly Val Val Met Ala Val Ala Tyr Gly Leu 
225 230 235 240 

lie Ser Arg Glu Leu Tyr Leu Gly Leu Arg Phe Asp Gly Asp Ser Asp 
]^ 245 250 255 

20 Ser Asp Ser Gin Ser Arg Val Arg Asn Gin Gly Gly Leu Pro Gly Ala 

^^ 260 265 270 

13 Val His Gin Asn Gly Arg Cys Arg Pro Glu Thr Gly Ala Val Gly Lys 

1^^ 275 280 285 

Asp Ser Asp Gly Cys Tyr Val Gin Leu Pro Arg Ser Arg Pro Ala Leu 
25 290 295 300 

Glu Leu Thr Ala Leu Thr Ala Pro Gly Pro Gly Ser Gly Ser Arg Pro 
305 310 315 320 

Thr Gin Ala Lys Leu Leu Ala Lys Lys Arg Val Lys Arg Met Leu Leu 
325 330 335 

30 Val lie Val Val Leu Phe Phe Leu Cys Trp Leu Pro Val Tyr Ser Ala 

340 345 350 

Asn Thr Trp Arg Ala Phe Asp Gly Pro Gly Ala His Arg Ala Leu Ser 
355 360 365 

Val Ala Pro lie Ser Phe lie His Leu Leu Ser Tyr Ala Ser Ala Cys 
35 370 375 380 

Val Asn Pro Leu Val Tyr Cys Phe Met His Arg Arg Phe Arg Gin Ala 
385 390 395 400 
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Cys Leu Glu Thr Cys Ala Arg Cys 
405 

Pro Arg Ala Leu Pro Asp Glu Asp 
420 

5 Leu Ser Arg Leu Ser Tyr Thr Thr 

435 440 

(134) INFORMATION FOR SEQ ID NO: 133: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1014 base pairs 
10 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:133: 

15 ATGAACAGCA CATGTATTGA AGAACAGCAT GACCTGGATC ACTATTTGTT TCCCATTGTT 60 

TACATCTTTG TGATTATAGT CAGCATTCCA GCCAATATTG GATCTCTGTG TGTGTCTTTC 120 

CTGCAAGCAA AGAAGGAAAG TGAACTAGGA ATTTACCTCT TCAGTTTGTC ACTATCAGAT 180 

TTACTCTATG CATTAACTCT CCCTTTATGG ATTGATTATA CTTGGAATAA AGACAACTGG 240 

ACTTTCTCTC CTGCCTTGTG CAAAGGGAGT GCTTTTCTCA TGTACATGAA TTTTTACAGC 3 00 

20 AGCACAGCAT TCCTCACCTG CATTGCCGTT GATCGGTATT TGGCTGTTGT CTACCCTTTG 360 

AAGTTTTTTT TCCTAAGGAC AAGAAGATTT GCACTCATGG TCAGCCTGTC CATCTGGATA 42 0 

TTGGAAACCA TCTTCAATGC TGTCATGTTG TGGGAAGATG AAACAGTTGT TGAATATTGC 4 80 

GATGCCGAAA AGTCTAATTT TACTTTATGC TATGACAAAT ACCCTTTAGA GAAATGGCAA 54 0 

ATCAACCTCA ACTTGTTCAG GACGTGTACA GGCTATGCAA TACCTTTGGT CACCATCCTG 600 

25 ATCTGTAACC GGAAAGTCTA CCAAGCTGTG CGGCACAATA AAGCCACGGA AAACAAGGAA 660 

AAGAAGAGAA TCAAAAAACT ACTTGTCAGC ATCACAGTTA CTTTTGTCTT ATGCTTTACT 72 0 

CCCTTTCATG TGATGTTGCT GATTCGCTGC ATTTTAGAGC ATGCTGTGAA CTTCGAAGAC 780 

CACAGCAATT CTGGGAAGCG AACTTACACA ATGTATAGAA TCACGGTTGC ATTAACAAGT 840 

TTAAATTGTG TTGCTGATCC AATTCTGTAC TGTTTTGTTA CCGAAACAGG AAGATATGAT 900 

30 ATGTGGAATA TATTAAAATT CTGCACTGGG AGGTGTAATA CATCACAAAG ACAAAGAAAA 960 

CGCATACTTT CTGTGTCTAC AAAAGATACT ATGGAATTAG AGGTCCTTGA GTAG 1014 



Cys Pro Arg Pro 
410 

Pro Pro Thr Pro 
425 

lie Ser Thr Leu 



PATENT 

Pro Arg Ala Arg 
415 

Ser lie Ala Ser 
430 

Gly Pro Gly 
445 
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(135) INFORMATION FOR SEQ ID NO: 134: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 337 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 



(xi) SEQXJENCE DESCRIPTION: SEQ ID NO: 134: 

Met Asn Ser Thr Cys lie Glu Glu Gin His Asp Leu Asp His Tyr Leu 
10 1 5 10 15 

Phe Pro lie Val Tyr lie Phe Val lie lie Val Ser lie Pro Ala Asn 
20 25 30 

lie Gly Ser Leu Cys Val Ser Phe Leu Gin Ala Lys Lys Glu Ser Glu 
35 40 45 

15 Leu Gly lie Tyr Leu Phe Ser Leu Ser Leu Ser Asp Leu Leu Tyr Ala 

50 55 60 

Leu Thr Leu Pro Leu Trp lie Asp Tyr Thr Trp Asn Lys Asp Asn Trp 
65 70 75 80 

Thr Phe Ser Pro Ala Leu Cys Lys Gly Ser Ala Phe Leu Met Tyr Met 
20 85 90 95 

Asn Phe Tyr Ser Ser Thr Ala Phe Leu Thr Cys lie Ala Val Asp Arg 
100 105 110 

Tyr Leu Ala Val Val Tyr Pro Leu Lys Phe Phe Phe Leu Arg Thr Arg 
115 120 125 

25 Arg Phe Ala Leu Met Val Ser Leu Ser lie Trp lie Leu Glu Thr lie 

130 135 140 

Phe Asn Ala Val Met Leu Trp Glu Asp Glu Thr Val Val Glu Tyr Cys 
145 150 155 160 

Asp Ala Glu Lys Ser Asn Phe Thr Leu Cys Tyr Asp Lys Tyr Pro Leu 
30 165 170 175 

Glu Lys Trp Gin lie Asn Leu Asn Leu Phe Arg Thr Cys Thr Gly Tyr 
180 185 190 

Ala lie Pro Leu Val Thr lie Leu lie Cys Asn Arg Lys Val Tyr Gin 
195 200 205 



35 



Ala Val Arg His Asn Lys Ala Thr Glu Asn Lys Glu Lys Lys Arg lie 
210 215 220 
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Lys Lys Leu Leu Val Ser He Thr Val Thr Phe Val Leu Cys Phe Thr 
225 230 235 240 

Pro Phe His Val Met Leu Leu He Arg Cys He Leu Glu His Ala Val 
245 250 255 

5 Asn Phe Glu Asp His Ser Asn Ser Gly Lys Arg Thr Tyr Thr Met Tyr 

260 265 270 

Arg He Thr Val Ala Leu Thr Ser Leu Asn Cys Val Ala Asp Pro He 
275 280 285 

Leu Tyr Cys Phe Val Thr Glu Thr Gly Arg Tyr Asp Met Trp Asn He 
10 290 295 300 

Leu Lys Phe Cys Thr Gly Arg Cys Asn Thr Ser Gin Arg Gin Arg Lys 
305 310 315 320 

Arg He Leu Ser Val Ser Thr Lys Asp Thr Met Glu Leu Glu Val Leu 
325 330 335 

15 Glu 



(13 6) INFORMATION FOR SEQ ID NO: 135: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 999 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 135: 

25 ATGGTGAACT CCACCCACCG TGGGATGCAC ACTTCTCTGC ACCTCTGGAA CCGCAGCAGT 
60 

TACAGACTGC ACAGCAATGC CAGTGAGTCC CTTGGAAAAG GCTACTCTGA TGGAGGGTGC 
120 

TACGAGCAAC TTTTTGTCTC TCCTGAGGTG TTTGTGACTC TGGGTGTCAT CAGCTTGTTG 
30 180 

GAGAATATCT TAGTGATTGT GGCAATAGCC AAGAACAAGA ATCTGCATTC ACCCATGTAC 
240 

TTTTTCATCT GCAGCTTGGC TGTGGCTGAT ATGCTGGTGA GCGTTTCAAA TGGATCAGAA 
300 



35 ACCATTATCA TCACCCTATT AAACAGTACA GATACGGATG CACAGAGTTT CACAGTGAAT 
360 
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ATTGATAATG TCATTGACTC GGTGATCTGT AGCTCCTTGC TTGCATCCAT TTGCAGCCTG 
420 



CTTTCAATTG CAGTGGACAG GTACTTTACT ATCTTCTATG CTCTCCAGTA CCATAACATT 
480 



5 ATGACAGTTA AGCGGGTTGG GATCAGCATA AGTTGTATCT GGGCAGCTTG CACGGTTTCA 
540 

GGCATTTTGT TCATCATTTA CTCAGATAGT AGTGCTGTCA TCATCTGCCT CATCACCATG 
600 

TTCTTCACCA TGCTGGCTCT CATGGCTTCT CTCTATGTCC ACATGTTCCT GATGGCCAGG 
10 660 



CTTCACATTA AGAGGATTGC TGTCCTCCCC GGCACTGGTG CCATCCGCCA AGGTGCCAAT 
720 



Si 
If] 
m 



ATGAAGGGAA AAATTACCTT GACCATCCTG ATTGGCGTCT TTGTTGTCTG CTGGGCCCCA 
780 

15 TTCTTCCTCC ACTTAATATT CTACATCTCT TGTCCTCAGA ATCCATATTG TGTGTGCTTC 
840 

ATGTCTCACT TTAACTTGTA TCTCATACTG ATCATGTGTA ATTCAATCAT CGATCCTCTG 



3^! 900 

5 y 

g ATTTATGCAC TCCGGAGTCA AGAACTGAGG AAAACCTTCA AAGAGATCAT CTGTTGCTAT 

P 20 960 

i f\ 

i3 CCCCTGGGAG GCCTTTGTGA CTTGTCTAGC AGATATTAA 

\i 999 

1^ (13 7) INFORMATION FOR SEQ ID NO: 136: 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 3 32 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 



30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 6: 

Met Val Asn Ser Thr His Arg Gly Met His Thr Ser Leu His Leu Trp 
15 10 15 

Asn Arg Ser Ser Tyr Arg Leu His Ser Asn Ala Ser Glu Ser Leu Gly 
20 25 30 



35 



Lys Gly Tyr Ser Asp Gly Gly Cys Tyr Glu Gin Leu Phe Val Ser Pro 
35 40 45 
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Glu Val Phe Val Thr Leu Gly Val lie Ser Leu Leu Glu Asn lie Leu 
50 55 60 

Val lie Val Ala lie Ala Lys Asn Lys Asn Leu His Ser Pro Met Tyr 
65 70 75 80 

5 Phe Phe lie Cys Ser Leu Ala Val Ala Asp Met Leu Val Ser Val Ser 

85 90 95 

Asn Gly Ser Glu Thr lie lie lie Thr Leu Leu Asn Ser Thr Asp Thr 
100 105 110 

Asp Ala Gin Ser Phe Thr Val Asn lie Asp Asn Val lie Asp Ser Val 
10 115 120 125 

lie Cys Ser Ser Leu Leu Ala Ser lie Cys Ser Leu Leu Ser lie Ala 
130 135 140 

Val Asp Arg Tyr Phe Thr lie Phe Tyr Ala Leu Gin Tyr His Asn lie 
145 150 155 160 

15 Met Thr Val Lys Arg Val Gly lie Ser lie Ser Cys lie Trp Ala Ala 

165 170 175 



Cys Thr Val Ser Gly lie Leu Phe lie He Tyr Ser Asp Ser Ser Ala 
III 180 185 190 



Val He lie Cys Leu He Thr Met Phe Phe Thr Met Leu Ala Leu Met 
20 195 200 205 

iJi : 
3 *=?: 

Ala Ser Leu Tyr Val His Met Phe Leu Met Ala Arg Leu His He Lys 

'^1 210 215 220 

n 

1*4= Arg He Ala Val Leu Pro Gly Thr Gly Ala He Arg Gin Gly Ala Asn 

225 230 235 240 

25 Met Lys Gly Lys He Thr Leu Thr He Leu He Gly Val Phe Val Val 

245 250 255 

Cys Trp Ala Pro Phe Phe Leu His Leu He Phe Tyr He Ser Cys Pro 
260 265 270 

Gin Asn Pro Tyr Cys Val Cys Phe Met Ser His Phe Asn Leu Tyr Leu 
30 275 280 285 

He Leu He Met Cys Asn Ser He He Asp Pro Leu He Tyr Ala Leu 
290 295 300 

Arg Ser Gin Glu Leu Arg Lys Thr Phe Lys Glu He He Cys Cys Tyr 
305 310 315 320 

35 Pro Leu Gly Gly Leu Cys Asp Leu Ser Ser Arg Tyr 

325 330 



(13 8) INFORMATION FOR SEQ ID NO: 137: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137; 

GCCAATATGA AGGGAAAAAT TACCTTGACC ATC 
33 

10 (137) INFORMATION FOR SEQ ID NO: 138: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 138 

CTCCTTCGGT CCTCCTATCG TTGTCAGAAG T 
31 

20 (14 0) INFORMATION FOR SEQ ID NO: 13 9: 

(i) SEQtJENCE CHARACTERISTICS: 

(A) LENGTH: 1842 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
25 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 9: 

ATGGGGCCCA CCCTAGCGGT TCCCACCCCC TATGGCTGTA TTGGCTGTAA GCTACCCCAG 60 

CCAGAATACC CACCGGCTCT AATCATCTTT ATGTTCTGCG CGATGGTTAT CACCATCGTT 12 0 

30 GTAGACCTAA TCGGCAACTC CATGGTCATT TTGGCTGTGA CGAAGAACT^ GAAGCTCCGG 180 

AATTCTGGCA ACATCTTCGT GGTCAGTCTC TCTGTGGCCG ATATGCTGGT GGCCATCTAC 24 0 

CCATACCCTT TGATGCTGCA TGCCATGTCC ATTGGGGGCT GGGATCTGAG CCAGTTACAG 3 00 

TGCCAGATGG TCGGGTTCAT CACAGGGCTG AGTGTGGTCG GCTCCATCTT CAACATCGTG 360 
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GCAATCGCTA TCAACCGTTA CTGCTACATC TGCCACAGCC TCCAGTACGA ACGGATCTTC 42 0 

AGTGTGCGCA ATACCTGCAT CTACCTGGTC ATCACCTGGA TCATGACCGT CCTGGCTGTC 4 80 

CTGCCCAACA TGTACATTGG CACCATCGAG TACGATCCTC GCACCTACAC CTGCATCTTC 54 0 

AACTATCTGA ACAACCCTGT CTTCACTGTT ACCATCGTCT GCATCCACTT CGTCCTCCCT 600 

5 CTCCTCATCG TGGGTTTCTG CTACGTGAGG ATCTGGACCA AAGTGCTGGC GGCCCGTGAC 660 

CCTGCAGGGC AGAATCCTGA CAACCAACTT GCTGAGGTTC GCAATTTTCT AACCATGTTT 72 0 

GTGATCTTCC TCCTCTTTGC AGTGTGCTGG TGCCCTATCA ACGTGCTCAC TGTCTTGGTG 780 

GCTGTCAGTC CGAAGGAGAT GGCAGGCAAG ATCCCCAACT GGCTTTATCT TGCAGCCTAC 840 

TTCATAGCCT ACTTCAACAG CTGCCTCAAC GCTGTGATCT ACGGGCTCCT CAATGAGAAT 900 

10 TTCCGAAGAG AATACTGGAC CATCTTCCAT GCTATGCGGC ACCCTATCAT ATTCTTCCCT 960 

GGCCTCATCA GTGATATTCG TGAGATGCAG GAGGCCCGTA CCCTGGCCCG CGCCCGTGCC 1020 

CATGCTCGCG ACCAAGCTCG TGAACAAGAC CGTGCCCATG CCTGTCCTGC TGTGGAGGAA 1080 

ACCCCGATGA ATGTCCGGAA TGTTCCATTA CCTGGTGATG CTGCAGCTGG CCACCCCGAC 114 0 

CGTGCCTCTG GCCACCCTAA GCCCCATTCC AGATCCTCCT CTGCCTATCG CAAATCTGCC 12 00 

15 TCTACCCACC ACAAGTCTGT CTTTAGCCAC TCCAAGGCTG CCTCTGGTCA CCTCAAGCCT 1260 

GTCTCTGGCC ACTCCAAGCC TGCCTCTGGT CACCCCAAGT CTGCCACTGT CTACCCTAAG 132 0 

CGTGCCTCTG TCCATTTCAA GGGTGACTCT GTCCATTTCA AGGGTGACTC TGTCCATTTC 13 80 

AAGCCTGACT CTGTTCATTT CAAGCCTGCT TCCAGCAACC CCAAGCCCAT CACTGGCCAC 1440 

CATGTCTCTG CTGGCAGCCA CTCCAAGTCT GCCTTCAGTG CTGCCACCAG CCACCCTAAA 1500 

20 CCCATCAAGC CAGCTACCAG CCATGCTGAG CCCACCACTG CTGACTATCC CAAGCCTGCC 1560 

ACTACCAGCC ACCCTAAGCC CGCTGCTGCT GACAACCCTG AGCTCTCTGC CTCCCATTGC 1620 

CCCGAGATCC CTGCCATTGC CCACCCTGTG TCTGACGACA GTGACCTCCC TGAGTCGGCC 1680 

TCTAGCCCTG CCGCTGGGCC CACCAAGCCT GCTGCCAGCC AGCTGGAGTC TGACACCATC 1740 

GCTGACCTTC CTGACCCTAC TGTAGTCACT ACCAGTACCA ATGATTACCA TGATGTCGTG 1800 

25 GTTGTTGATG TTGAAGATGA TCCTGATGAA ATGGCTGTGT GA 1842 
(141) INFORMATION FOR SEQ ID NO: 14 0: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 613 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 140: 

5 Met Gly Pro Thr Leu Ala Val Pro Thr Pro Tyr Gly Cys lie Gly Cys 

15 10 15 

Lys Leu Pro Gin Pro Glu Tyr Pro Pro Ala Leu lie lie Phe Met Phe 
20 25 30 

Cys Ala Met Val lie Thr lie Val Val Asp Leu lie Gly Asn Ser Met 
10 35 40 45 

Val lie Leu Ala Val Thr Lys Asn Lys Lys Leu Arg Asn Ser Gly Asn 
50 55 60 

lie Phe Val Val Ser Leu Ser Val Ala Asp Met Leu Val Ala lie Tyr 
65 70 75 80 

15 Pro Tyr Pro Leu Met Leu His Ala Met Ser lie Gly Gly Trp Asp Leu 

85 90 95 

Ser Gin Leu Gin Cys Gin Met Val Gly Phe lie Thr Gly Leu Ser Val 
100 105 110 

Val Gly Ser lie Phe Asn lie Val Ala lie Ala lie Asn Arg Tyr Cys 
20 115 120 125 

Tyr lie Cys His Ser Leu Gin Tyr Glu Arg lie Phe Ser Val Arg Asn 
130 135 140 

Thr Cys lie Tyr Leu Val lie Thr Trp lie Met Thr Val Leu Ala Val 
145 150 155 160 

25 Leu Pro Asn Met Tyr lie Gly Thr lie Glu Tyr Asp Pro Arg Thr Tyr 

165 170 175 

Thr Cys lie Phe' Asn Tyr Leu Asn Asn Pro Val Phe Thr Val Thr lie 
180 185 190 

Val Cys lie His Phe Val Leu Pro Leu Leu lie Val Gly Phe Cys Tyr 
30 195 200 205 

Val Arg lie Trp Thr Lys Val Leu Ala Ala Arg Asp Pro Ala Gly Gin 
210 215 220 

Asn Pro Asp Asn Gin Leu Ala Glu Val Arg Asn Phe Leu Thr Met Phe 
225 230 235 240 



Val He Phe Leu Leu Phe Ala Val Cys Trp Cys Pro He Asn Val Leu 
245 250 255 
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Thr Val Leu Val Ala Val Ser Pro Lys Glu Met Ala Gly Lys lie Pro 
260 265 270 

Asn Trp Leu Tyr Leu Ala Ala Tyr Phe lie Ala Tyr Phe Asn Ser Cys 
275 280 285 

5 Leu Asn Ala Val lie Tyr Gly Leu Leu Asn Glu Asn Phe Arg Arg Glu 

290 295 300 

Tyr Trp Thr lie Phe His Ala Met Arg His Pro lie lie Phe Phe Pro 
305 310 315 320 

Gly Leu lie Ser Asp lie Arg Glu Met Gin Glu Ala Arg Thr Leu Ala 
10 325 330 335 

Arg Ala Arg Ala His Ala Arg Asp Gin Ala Arg Glu Gin Asp Arg Ala 
340 345 350 

His Ala Cys Pro Ala Val Glu Glu Thr Pro Met Asn Val Arg Asn Val 
355 360 365 

15 Pro Leu Pro Gly Asp Ala Ala Ala Gly His Pro Asp Arg Ala Ser Gly 

370 375 380 

His Pro Lys Pro His Ser Arg Ser Ser Ser Ala Tyr Arg Lys Ser Ala 
385 390 395 400 

Ser Thr His His Lys Ser Val Phe Ser His Ser Lys Ala Ala Ser Gly 
20 405 410 415 

P His Leu Lys Pro Val Ser Gly His Ser Lys Pro Ala Ser Gly His Pro 

'""4 420 425 430 

1^^= Lys Ser Ala Thr Val Tyr Pro Lys Pro Ala Ser Val His Phe Lys Gly 

435 440 445 

25 Asp Ser Val His Phe Lys Gly Asp Ser Val His Phe Lys Pro Asp Ser 

450 455 460 

Val His Phe Lys Pro Ala Ser Ser Asn Pro Lys Pro lie Thr Gly His 
465 470 475 480 

His Val Ser Ala Gly Ser His Ser Lys Ser Ala Phe Ser Ala Ala Thr 
30 485 490 495 

Ser His Pro Lys Pro lie Lys Pro Ala Thr Ser His Ala Glu Pro Thr 
500 505 510 

Thr Ala Asp Tyr Pro Lys Pro Ala Thr Thr Ser His Pro Lys Pro Ala 
515 520 525 

35 Ala Ala Asp Asn Pro Glu Leu Ser Ala Ser His Cys Pro Glu lie Pro 

530 535 540 

Ala lie Ala His Pro Val Ser Asp Asp Ser Asp Leu Pro Glu Ser Ala 



ill 

; -XT 

in 
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Ser Ser Pro Ala Ala Gly Pro Thr Lys Pro Ala Ala Ser Gin Leu Glu 
565 570 575 

Ser Asp Thr lie Ala Asp Leu Pro Asp Pro Thr Val Val Thr Thr Ser 
5 580 585 590 

Thr Asn Asp Tyr His Asp Val Val Val Val Asp Val Glu Asp Asp Pro 
595 600 605 

Asp Glu Met Ala Val v 
610 

10 (142) INFORMATION FOR SEQ ID NO: 141: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1842 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141: 
ATGGGGCCCA CCCTAGCGGT TCCCACCCCC TATGGCTGTA TTGGCTGTAA GCTACCCCAG 60 

CCAGAATACC CACCGGCTCT AATCATCTTT ATGTTCTGCG CGATGGTTAT CACCATCGTT 120 

20 GTAGACCTAA TCGGCAACTC CATGGTCATT TTGGCTGTGA CGAAGAACAA GAAGCTCCGG 180 

AATTCTGGCA ACATCTTCGT GGTCAGTCTC TCTGTGGCCG ATATGCTGGT GGCCATCTAC 240 

CCATACCCTT TGATGCTGCA TGCCATGTCC ATTGGGGGCT GGGATCTGAG CCAGTTACAG 300 

TGCCAGATGG TCGGGTTCAT CACAGGGCTG AGTGTGGTCG GCTCCATCTT CAACATCGTG 3 60 

GCAATCGCTA TCAACCGTTA CTGCTACATC TGCCACAGCC TCCAGTACGA ACGGATCTTC 4 20 

25 AGTGTGCGCA ATACCTGCAT CTACCTGGTC ATCACCTGGA TCATGACCGT CCTGGCTGTC 4 80 

CTGCCCAACA TGTACATTGG CACCATCGAG TACGATCCTC GCACCTACAC CTGCATCTTC 540 

AACTATCTGA ACAACCCTGT CTTCACTGTT ACCATCGTCT GCATCCACTT CGTCCTCCCT 600 

CTCCTCATCG TGGGTTTCTG CTACGTGAGG ATCTGGACCA AAGTGCTGGC GGCCCGTGAC 660 

CCTGCAGGGC AGAATCCTGA CAACCAACTT GCTGAGGTTC GCAATAAACT AACCATGTTT 72 0 

30 GTGATCTTCC TCCTCTTTGC AGTGTGCTGG TGCCCTATCA ACGTGCTCAC TGTCTTGGTG 7 80 

GCTGTCAGTC CGAAGGAGAT GGCAGGCAAG ATCCCCAACT GGCTTTATCT TGCAGCCTAC 840 
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TTCATAGCCT ACTTCAACAG CTGCCTCAAC GCTGTGATCT ACGGGCTCCT CAATGAGAAT 9 00 
TTCCGAAGAG AATACTGGAC CATCTTCCAT GCTATGCGGC ACCCTATCAT ATTCTTCTCT 960 

GGCCTCATCA GTGATATTCG TGAGATGCAG GAGGCCCGTA CCCTGGCCCG CGCCCGTGCC 102 0 

CATGCTCGCG ACCAAGCTCG TGAACAAGAC CGTGCCCATG CCTGTCCTGC TGTGGAGGAA 10 80 

5 ACCCCGATGA ATGTCCGGAA TGTTCCATTA CCTGGTGATG CTGCAGCTGG CCACCCCGAC 114 0 

CGTGCCTCTG GCCACCCTAA GCCCCATTCC AGATCCTCCT CTGCCTATCG CAAATCTGCC 12 00 

TCTACCCACC ACAAGTCTGT CTTTAGCCAC TCCAAGGCTG CCTCTGGTCA CCTCAAGCCT 12 60 

GTCTCTGGCC ACTCCAAGCC TGCCTCTGGT CACCCCAAGT CTGCCACTGT CTACCCTAAG 13 2 0 

CGTGCCTCTG TCCATTTCAA GGCTGACTCT GTCCATTTCA AGGGTGACTC TGTCCATTTC 13 80 

10 AAGCCTGACT CTGTTCATTT CAAGCCTGCT TCCAGCAACC CCAAGCCCAT CACTGGCCAC 1440 

CATGTCTCTG CTGGCAGCCA CTCCAAGTCT GCCTTCAATG CTGCCACCAG CCACCCTAAA 1500 

CCCATCAAGC CAGCTACCAG CCATGCTGAG CCCACCACTG CTGACTATCC CAAGCCTGCC 1560 

ACTACCAGCC ACCCTAAGCC CGCTGCTGCT GACAACCCTG AGCTCTCTGC CTCCCATTGC 162 0 

CCCGAGATCC CTGCCATTGC CCACCCTGTG TCTGACGACA GTGACCTCCC TGAGTCGGCC 168 0 

15 TGTAGCCCTG CCGCTGGGCC CACCAAGCCT GCTGCCAGCC AGCTGGAGTC TGACACCATC 174 0 

GCTGACCTTC CTGACCCTAC TGTAGTCACT ACCAGTACCA ATGATTACCA TGATGTCGTG 180 0 

GTTGTTGATG TTGAAGATGA TCCTGATGAA ATGGCTGTGT GA 184 2 
(14 3) INFORMATION FOR SEQ ID NO: 142: 



20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 613 amino acids 

(B) TYPE: amino acid 
{ C ) STRANDEDNES S : 

(D) TOPOLOGY: not relevant 



(ii) MOLECULE TYPE: protein 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142: 



Met Gly Pro Thr Leu Ala Val Pro Thr Pro Tyr Gly Cys lie Gly Cys 
15 10 15 



Lys Leu Pro Gin Pro Glu Tyr Pro Pro Ala Leu lie lie Phe Met Phe 
20 25 30 



30 



Cys Ala Met Val lie Thr lie Val Val Asp Leu lie Gly Asn Ser Met 
35 40 45 



'0 

ifi 

m 
m 
m 
III 



in 
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Val He Leu Ala Val Thr Lys Asn Lys Lys Leu Arg Asn Ser Gly Asn 
50 55 60 

He Phe Val Val Ser Leu Ser Val Ala Asp Met Leu Val Ala He Tyr 
65 70 75 80 

5 Pro Tyr Pro Leu Met Leu His Ala Met Ser He Gly Gly Trp Asp Leu 

85 90 95 

Ser Gin Leu Gin Cys Gin Met Val Gly Phe He Thr Gly Leu Ser Val 
100 105 110 

Val Gly Ser He Phe Asn He Val Ala He Ala He Asn Arg Tyr Cys 
10 115 120 125 

Tyr He Cys His Ser Leu Gin Tyr Glu Arg He Phe Ser Val Arg Asn 
130 135 140 

Thr Cys He Tyr Leu Val He Thr Trp He Met Thr Val Leu Ala Val 
145 150 155 160 

15 Leu Pro Asn Met Tyr He Gly Thr He Glu Tyr Asp Pro Arg Thr Tyr 

165 170 175 

Thr Cys He Phe Asn Tyr Leu Asn Asn Pro Val Phe Thr Val Thr He 
180 185 190 

Val Cys He His Phe Val Leu Pro Leu Leu He Val Gly Phe Cys Tyr 
20 195 200 205 



P Val Arg He Trp Thr Lys Val Leu Ala Ala Arg Asp Pro Ala Gly Gin 

'Sf 210 215 220 

1^ Asn Pro Asp Asn Gin Leu Ala Glu Val Arg Asn Lys Leu Thr Met Phe 

225 230 235 240 

25 Val He Phe Leu Leu Phe Ala Val Cys Trp Cys Pro He Asn Val Leu 

245 250 255 

Thr Val Leu Val Ala Val Ser Pro Lys Glu Met Ala Gly Lys He Pro 
260 265 270 

Asn Trp Leu Tyr Leu Ala Ala Tyr Phe He Ala Tyr Phe Asn Ser Cys 
30 275 280 285 

Leu Asn Ala Val He Tyr Gly Leu Leu Asn Glu Asn Phe Arg Arg Glu 
290 295 300 

Tyr Trp Thr He Phe His Ala Met Arg His Pro He He Phe Phe Ser 
305 310 315 320 

35 Gly Leu He Ser Asp He Arg Glu Met Gin Glu Ala Arg Thr Leu Ala 

325 330 335 



Arg Ala Arg Ala His Ala Arg Asp Gin Ala Arg Glu Gin Asp Arg Ala 
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340 345 350 

His Ala Cys Pro Ala Val Glu Glu Thr Pro Met Asn Val Arg Asn Val 
355 360 365 

Pro Leu Pro Gly Asp Ala Ala Ala Gly His Pro Asp Arg Ala Ser Gly 
5 370 375 380 

His Pro Lys Pro His Ser Arg Ser Ser Ser Ala Tyr Arg Lys Ser Ala 
385 390 395 400 

Ser Thr His His Lys Ser Val Phe Ser His Ser Lys Ala Ala Ser Gly 
405 410 415 

10 His Leu Lys Pro Val Ser Gly His Ser Lys Pro Ala Ser Gly His Pro 

420 425 430 

Lys Ser Ala Thr Val Tyr Pro Lys Pro Ala Ser Val His Phe Lys Ala 
435 440 445 

Asp Ser Val His Phe Lys Gly Asp Ser Val His Phe Lys Pro Asp Ser 
15 450 455 460 

Val His Phe Lys Pro Ala Ser Ser Asn Pro Lys Pro lie Thr Gly His 
465 470 475 480 

His Val Ser Ala Gly Ser His Ser Lys Ser Ala Phe Asn Ala Ala Thr 
485 490 495 

20 Ser His Pro Lys Pro lie Lys Pro Ala Thr Ser His Ala Glu Pro Thr 

500 505 510 

Thr Ala Asp Tyr Pro Lys Pro Ala Thr Thr Ser His Pro Lys Pro Ala 
515 520 525 

Ala Ala Asp Asn Pro Glu Leu Ser Ala Ser His Cys Pro Glu lie Pro 
25 530 535 540 

Ala lie Ala His Pro Val Ser Asp Asp Ser Asp Leu Pro Glu Ser Ala 
545 550 555 560 

Ser Ser Pro Ala Ala Gly Pro Thr Lys Pro Ala Ala Ser Gin Leu Glu 
565 570 575 

30 Ser Asp Thr lie Ala Asp Leu Pro Asp Pro Thr Val Val Thr Thr Ser 

580 585 590 

Thr Asn Asp Tyr His Asp Val Val Val Val Asp Val Glu Asp Asp Pro 
595 600 605 

Asp Glu Met Ala Val 
35 610 



(144) INFORMATION FOR SEQ ID NO: 143: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143 
GCTGAGGTTC GCAATAAACT AACCATGTTT GTG 
(145) INFORMATION FOR SEQ ID NO: 144: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 144: 
CTCCTTCGGT CCTCCTATCG TTGTCAGAAG T 
(146) INFORMATION FOR SEQ ID NO:145: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
25 (iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 5: 
TTAGATATCG GGGCCCACCC TAGCGGT 
(147) INFORMATION FOR SEQ ID NO: 146: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(i) 



20 
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(iv) ANTI- SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 146: 
GGTACCCCCA CAGCCATTTC ATCAGGATC 33 



